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-Et METHODS ™ THE I DENT I F I CAT I ON „\ JemB ^ ^ 

FIELD OF THE INVENTION 

The present invention pertains m a 

5 .o dulating the activity Jn p viv r 0 1 ; ;:r: r c acids capawe - 

cells. More sped f ica , , v fh . ^ e " 2ymeS ln «"kar yot ic 

— mca tio : /prep ;;: ; ;~ « a ™t h od for 

as inhibfr^. - "ithecto unknown enhancer, „ 

-« h .™,™;" t ;: :;: on en r: t :rr ity : n euk " yotic - us - 

10 of unknown interaction, , ^ ide """«tion 

« U fl .nd but ali o h th !' ldM ""<=«"™ »f a target and/or 
— « k„ t ar ; T ° h Unkn0 " n — — >»twe.„ known 
i-MMtor structures " ^ ""^ 

Present potenti.il, biol.,".^ \ ' ^ '"'"""^"ly 

" ^ids in a stable fc™ \T " * ° r ^"""udeic 

as m e th ods for th /£ ~» ™ T™' " 

cells carryina th P „or, „ ■ vectors and transformed 

rying tne genetic information encoding m, 
and targets. Finally th« ■ • en coding these ligands 

- Preparation of a n die lV D T "^^ t0 * ~ *,r the 
identification of ^ " hlCh iS ^ °" initial 

invention. ^ ^ to the present 

BACKGROUND OF THE INVENTION 

peptide sequences having bioloaical ^t-,- 
vivo and is di =^1 • "^logical activity in 

disclosed m WO 96/38553. m short hk • 
random peptides 3 r-~ snort, libraries of 

peptides are expressed intracellular!,, • 
cells, such that one cell exores. 11Ularly ln —alien 
* , ex P^esses one sinnio ^ 

heterologous short peptides Cell, ru 
30 phenotype under cert ° han9e a Preselected 

pe Pt i d : ^ th ; ; e ; ;;: s ; ondit :° ns can be is — - ^ 

ney exp ress can hence be identified. The 
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intraceHuiar component with which f h. 

— ^> «v =-3 equent ; h b irirrr interacts itha 
<* carrying the lmmobili i e ; sy t:: t ";: ;;; n : d i; 9 - — 

intracellular environment is ' relat T"™" Pr ° blem thmt the 
sion products which are expressed T„ * t0 «P«s- 
Peptide or nucleic acid J q ZZ 'whL "° rdS ' lnt «"»'" 

<>< interact^ „i th an i J r "*'"»«»Y are capaMe 

or inactivated inside the cell i? ^ ^ 

can be detected. a " y effect on P 1 * enotype 



OBJECT OF THE INVENTION 



It is an object of the invention ^ 

Cellscreen technology by overco ? Movement, in the 

» of poten tial lnBtab £ * — :: ed the — ned Problems 

is an ob ject of the invention T.^JT""' ' " 

CellScreen technology to also e ^ Utllity ° f the 

cells. alS ° —Pass screening in prokaryotic 



SUMMARY OF THE INVENTION 



20 A signifi cant number of M 

-crobial an d eu k a r ^ tlc c.T ""^ 

below. C611 ° rigin hav * been described, C f. 

Since many of the naturally OCCUrHn 

Ovulated by enzvmes. the LZ 0 ^IT"" ^ ^ 
^ expression of l arg e 1,,,..!,^ 3 

-tivitv .orators in „ hich T ™ 

Ues ln tne active site. This 



. Il r "' M ° f PUt " iVe no ^ 1 '^» "PabX. of modulating 
the ««»..» of an array ot different en2ymes » 

expressing these modulators in cell lines according to , BO J% 
-,»Uo» of the censcee, technology th . J c 
5 mechanisms inside the individual cell ln sald ^ \ ^ 

such as e.g. resistance towards hypoglycemia, cytokine fining 
to xl c compounds, virus infection etc. 

Ihe mam advantage of usin g k „own enzyme activity adulators 
10 such as enzyme inhiblCors as scaffold is that many of these in 

Z"rT£ 0 'r m Sta " e " environment 

"trail - '"°" 0t ' " SCa "° ld = *>* 

TntZT f PreSent " i0 " ° f ""*» P-P"d« 1. that .any such 
IS c Z int TT" - Proteolytic and redu- 

cin, intracellular environment and therefore such molecules can 

a d C ir h T y tail ° red ' mal " tain staMlU ' 

at the same time incorporate random sequences which are 

screened for Molo 9 ical activity. Purth.rmore, the effec rvity 

20 h T t SCree " ""^""V «"™ substances win be 

20 higher than if using an unstable scaffold (such as a e g a 

coiled coti structure, or no scaffold at ,11, since none or only 

71 " Umb " ° f »" d ^" d — s "ill be de 

graded before they can exert their effects in the cells. Fl - 

» m;:: ::? ™r activitv moduiat °- ha - - 

whrch is the perfect position in the molecule to 
-odxty, since the active site is normally presented in a stable 
configuration to the environment. 

Hence, one aspect of the invention pertains to a method 
for identifying an in vivo acclve modulator Qf ^ 

target enzyme, the method comprising the steps of ,a, preparing 

leaSt °" e memb « f ™» - "brary of randomly modified 
nucleotide sequences derived from a parent nucleotide sequence 
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pool so as to obtain tr an ,f , vectors of said 

uotain transformed cells ^-i^t *. 
5 identical cells bei™ substantially 
ceils being ones which harbour the t*™^ 
culturing said transformed cells under c H ^ 
expression of said M h i conditions facilitating 

ot said randomly modified nucleotide c ~ 
examining said transformed cells and 8equenc "' < d > 

cell(s) wherein the activity oJ the t" " 9 ^"^r^ 

- and c, determining said ^ ^ ^tilT 

:e t ::i d n i:;\;7i:;:r a :v ell(s) is — - -vzr 

g the ammo acid sequence or the ribonucleic acid 

able part of the parent nucleotlde ^J^^ " 

nucleotides. In line „ ith th . above , ^ ^ ™ *' 

parent nucleotide sequence prefab lnva "able part of the 

^0 o f the parent peptide or o f ^ ""^"T" 3 ^"^ 
serves to stabilise • „ ribonucleic acid which 

acid rva™ " P — — „t or ribonucleic 



As mentioned above th<= n<=o 

e, the use of enzyme modulator scaffold „i 
provides for expression of stable h ■ i • ° 
25 tors which interact ^ biologically active modula- 

nicn interact with other biomolecules th^ n 
other words in w les than enzymes. In 

r words, in cases where random nucleotides • 
scaffold structure t-K cieotides are inserted in a 

structure, the outcome will be a sf a hi a 

Product, th. activity of „ hlch does not nec s LiLT" 10 " 
thin, to do with e„ 2ym e activity adulation ' ^ 

30 a H :::;ii:n; p ;r f r:: rrr n is a ~ - 

fr.9~.~t or ribonucleic acid f " ^"-P"* 

ucleic acid fragment which is capable of de- 



5 

tectably modulating, in vivo ^ . 

m e thod co mprising the step 7;/ ^ typic trait in a cell# the 

(a) preparing a pool of expression vectors each 
Pool conning at least one from a \Z ^ 

5 modified nucleo tide seances derived from " °* 

sequence encoding a oarers ■ Parent nuc ^otide 

— ln w„ J u ; a ::r;: t r;r ; d :; r r rent ribonucieic ^ 
~ ly modlfled nucleotide se ; uenc : s k ::;; r ;:;- uhecein the 

» ; P ;::r:: b :; r P :;;;rv scaffoid port — 
, »rvi„„ t o staMnL :; d : 0 ; nuciei ° acid - ^ p-t io „ 

1 acid fragment, and P^Ptide fragment or ribonucle . c 

random nucleotides, 

» ^^^^ZlTZr SUbStanti3lly — 1 Ul- 
ceus, d P ° 01 S ° 35 t0 obt -n transformed 

(c) culturing said transformed CP n„ ^ 

ting expression of said J , ^ COnditi — facilita- 

randomly modified nucleotide sequences, 

« ^irrr ceiis and isoiati - — 

- presence of £ ^^T^ ^ * 

sequence, and randomly modified nucleotide 

(e> determining said randomly modified nucWn 
said vector present in cell( S ) isol , S6qUence of 

25 determining the amino aj u * ^ ^ ^ 
* sequence of the exor P „' ribonucleic acid 

scaffold proteins, ribonucleotides, or 
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7 thereof for the presentation c f random seguences 
the CellScreen technology. As mentioned above, although the 
concept of using scaffold molecules has been discussed in the 
prior art, the issue of the stability of the scaffold 
5 not been detailed. system has 

The stability and usefulnp^ a *. • 

, , „ . „ serulness of a putative intracellular scaf- 

fold is dependent on a number of factors. First of all it is 
essential that the relevant cell wherein the scaffold is to be 
expressed is capable of expressing the scaffold molecule in a 
10 functional for,; that is. in pro k aryotic system some eu.aryctic 
pro e ln s Blll not fold correctfy, hence rendering the use of 
such a protein unsuitable as a scaffold in that type of cell. 
Second, the scaffold should be relatively resistant to the 
reducing and catalytic environment inside intact ceHs. However 
15 even when a scaffold molecule is relatively susceptible to the 
inactivating nature of the intracellular environment, this can 
be remedied if the production rate of the scaffold molecule 
sufficiently high. 

in steady state, the intraceliular concentration of a scaffold 
20 „„lecu le will be a function of the foUowing formula: 

C - i 

^scaffold ~ Rj 

--here is the rate of production of the scaffold molecule 
<-les • and p. is the inactivation constant for the scaffold 

-l.=ul. ,1 ■ i.e. the rate of inactivation of the scaffold 

25 molecule is determined by*= (which, in steady state, 

of course equals Rp. in other words, when assessing the suitabi- 
. Uty of a potential scaffold molecule, it should according to 

tepnt invention be tested whether the molecule can be k ept 
at a sufficiently high concentration inside the relevant cell 
30 wherein the CellScreen test is going to be carried out 
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Therefore, . very broad aspect of the inventioh pertains to a 
~thod for ident^ying a viator in the form of a biologically 
actrve polypeptide fragment or ribonucleic acid fragment which 

■ s tzt :i detectabiy - ^ * tr . it 

of a cell, the method comprising the steps of 

(a, preparing a pool of expression vectors, each vector of said 
pool containing at least one member from a li brary of randomly 

modified nucleotide sequences derived fr™ , 

sequence encoding a parent peptide or parent ribonucleic acid 

nucl int "« llu1 "^' ""."in the randomly modified 

nucleotide sequences comprise 

an invariable part encoding a scaffold portion of the 
Parent peptide or of the parent ribonucleic acid, said 
scaffold portion serving to stabilize said polypeptide 
fragment or ribonucleic acid fragment, and 
random nucleotides, 

(b> transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 

cells, 

20 (c, culturing said transformed cells under conditions facilita- 
ting expression of said randomly modified nucleotide sequences, 

(d) examining said transformed cells and isolating transformed 
cell(s) wherem the preselected phenotypic trait is modulated by 
the presence of the expressed randomly modified nucleotide 

^5 sequence, and 

• (e) determining said randomly modified nucleotide sequence of 
^ said vector present in cell (a, isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic acid 
sequence of the expression product encoded by said randomly 

30 modified nucleotide sequence, 
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8 

wherein th . expression product of t „. „ ucleic ac 

wh lch . ncode . tne intraoellularly stable 2 

parent ribonucleic ac i d is one ^ ^ ^ 
substantially identical celis. is present in an effective con 
5 centration and in a functional state. 

sc™; 13 6SSential "^""y Of tne 

scaffo d molecuie is evaluated prior to perform, the steps of 
the Cellscreen technology in order to confirm that the scaffold 
molecule in unmodified form can b. scaffold 
«„„,. . , can be expressed and maintained at „ 

sufficiently high concentration/activity in the cellular , r 
where tne method of the invention is to be mr cV£ " 

TuZTnV: 96/38553 - the isoiatiOT °< * — — ~1- 

a e inserted : T" " 

for Isp ay 0 " : 9 " POlyPePtld " ^ctioning as scaffolds 

wouid di ;:::r b : 2:^:™^:—- suoh scaffoids 

taroet „,„,=„.„. 9h " a " ln "y interaction with the 



Nothing is , however, mentioned about th. us. of scaffdds de- 
rived from naturally occurring pro tein inhibitors of enzymes 
Inhibition of enzymatic activity by such inhibit en2 ^ es - 
- the simple binding of a t.rglt IrlT^T^".^ 
suggested in the c.UScreen technology - ls . rau ch m e « " 
=ient way to affect intracellular biochemical events. 



DETAILED DISCLOSURE OF THE INVENTION 

25 Def ini t i or> g 



in the following, a number of teriTls will be defined f 
purposes of the present disclosure: 
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A 



•modulator" is in the present context a biomolecule which 
when expressed in vivo, to effect the activity of another biomo- 
lecule in the cell. Thus, the modulator in essence can inhibit 
or enhance the activity of the biomolecule. Furthermore, the 
5 modulator can interact directly with the biomolecule, but the 
effect might as well be indirect (also known as transdominant ) , 
i.e. the activity change of the biomolecule is brought about by 
changes in the cell's biochemical machinery, changes which are 
ultimately the result of the presence of the expression product 
10 of the randomized nucleic acid sequence. 

A "randomly modified nucleotide sequence" is a nucleotide se- 
quence which in a number of positions has been subjected to 
insertion or substitution by nucleotides, the nature of which 
cannot be predicted. In many cases the random nucleotides or 
15 nucleotide sequences inserted will be "completely random" (e g 
as a consequence of randomized synthesis or PCR-mediated 
mutagenesis). However, as will appear from the disclosure below 
the random sequences can also include sequences which have a 
common functional feature (e.g. reactivity with a ligand of the 
20 expression product, or the random sequences can be random in the 
sense that the ultimate expression product is of completely 
random sequence with e.g. an even distribution of the different 
amino acids. 



"Substantially identical cells" is a term herein intended to 
25 designate cells which all exhibit a specific phenotypic trait in 
such a manner that a change in the expression of said trait in 
one cell due to an interaction effected by the introduction of 
random nucleotides according to the inventive methods would also 
occur in one of the other substantially identical cells had 
30 these been transformed with the same vector. In other words, the 
important parameter to assess when choosing substantially iden- 
tical cells m the inventive methods is whether an observed 
change in one cell's exhibition of the phenotypic trait can be 
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taken as an indication that any other cell in rh 
would have behaved th» Population 

Same WaV 33 3 consequence of the same 

i:;/:ri «rr tiaiiy im - - 

■ 5 or a t lssue : ul t: e llne " «" ^ of a cell e uUu re 

A "phenotypic trait" i s the n h«™=ki 

=o mP o Sition in . cell (g n h : ty ° p b : rv b e - suit °< * — 

,^ genotype; , i.e. a property of a ppIi 

suitable means) which depends on the presence of „„. 
10 and the expression rate thereof n T. \ll 

tr.« can he ,„ y of a number of different J 

j-xganas, cell surv v 3 i 

- a ^/ presence or absence of 
expression rate, etc. antigen, 

«~ n to 100 iL:° a d :; d ;; r rrr ou — ■* 

than inn 3 • • residues, and polypeptides of more 

than 100 ammo acid residues. Furthermore, the term is al 
intended to include proteins i „ * ■ ° 
on ^ • • Proteins, i.e. functional biomolecules 

20 comprising at least one polypeptide- whpn • SCUleS 

two polypeptides th. ^P^e, when comprising at least 

*ed or! ^ f ° rm C ° m P le - s r be covalently lin- 

ked, or may be non-covalently linked. The polypeptide (s) in a 
Protein can be glycosylated and/or linidaJrf V ^ tlde(s) ln a 
Prosthetic groups. ^P^ated and/or comprise 

25 vivo" is herein mean to designate th. 

t- ■ ■ u "ignate the environmental condi 

tions inside livinq cells h' « n condi- 
„ „• U - e ' Cells wnic h are metabolicallv 

active and can maintain their vit^ * . • Uy 

• , ^ ept in c U lt U re or J y Z^^L^a^ 

cells a. t SrS t0 ln vitro culturing of 
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'Transformation' 



ripri h A Pr ° CeSS by Which genetic material car- 
ried by an individual cell i«s ^ u ■ 

ceil is altered by incorporation of M n 
genous DNA into its genome. 

"Transfection": The unt-^P ^ 

uptake, incorporation, and expression of 
5 recombinant DNA by cells. expression of 

"Transduction": The tran^fpr 

cell to anoth k ° f genetlc information from one 

cell to another by way of a viral vector. 

The term "effective part" when used in the context of a 
peptide, or ribonucleic acid is in th* Protein, 
10 to mean a part (e g a su l COnteXt intend ^ 

protein a less h SUbSSgUenCe in the case of an n-meric 

Isired fu nct an_n " meriC m ° leCUle ' WhiCh h " "gained the 

protein fUnCtl ° nallty ° f the -tive molecule from which the 
protean or peptxd. is derived. Fo r instance, in the case of 

15 E I'":" V trUnCated - — cule described in 

3 example 1 seems to be neces^n, 

. . necessary to ensure expression of the 

In general, the disclosures in WO 96/3fls^ ^ 
20 relating 96/38553 and WO 97/27212 



25 



30 



to preparation of randomly se qoences , to the choice 

the r dom sequan=es _ ^ composition > of 
tar et sequences _ tQ ^ ^ 

" L I ! h ' fled ' " Ch0i " ° f —ion ra ethod. 

to the me thod s of introducing, the nucleic acids into the 
elevant celi t yp e, to the choice of (retroviral vec 
"here appUcaMe, , to the m ethod of pro d uci „ g the vectors 
the cno.ce of promot « 5 . to t „. ^ ^ 
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where WiaM.,, to th . methods Qf concentra 

0 ;i"" 0 " S f ™ "» """"i- «». <»nere .pp llc . ble) . 

to the cho.ee of substantially identical cells tQ use >' 
_ the method , to the type of phenotypic 

the - nner ln „ h ich the change is detected, to methods ' of ° 

o „7s Phen ° tyPi ""* "lis. to the isola- 

tion and sequence determination of the randomly edified 
seances, to the isolation and characteri 2 ation of the 

It " T- rand0ml2ed >° ™ing .ethods. 

and to the choice of applications of the methods 

The^Ze" 1 ;; 3 : 6 f r the purposes ° f the 

tnese references are focussed on the use of th~ 
-r c • , Ullt: use or the qeneral nr-in- 

15 ciple in higher eukaryotic cells th c 

= 1 . Y ceils, the present disclosure will 

- ot h ic s : ™~z^j^r r — of a 

T: o, the present JentT Is U^est ^le 

of the methods in prokaryotic systems. 

It is normally preferred that the transformed cells which are 
being examined i n step (d) predominantly carries (and 
25 one single codv of hh a . carries (and expresses) 

gie copy of the vector. By ensuring this, the interpret* 
tion of a change in phenotype of the cell, "terpreta- 
t-^u u cells becomes a much easier 

tas k , „hereas the interpretation of a phenotype change in cells 
expressed .ore than one single rando.i.ed seguenee renders 

To ensure that predominantly one vector h»« t- 

th= «n vector has transformed each of 

the cells examined it is ^ r, f«„ -w, , 

3 e>g - feasi ble that the transformation 

M&E Bioiech/CetlScrtertf DK UVK/ 1 998-07-20 
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step (b) is performed under such conditions that the cells 
transformed are predominantly or at most transformed with one 
single vector from said pool (this can e.g. be achieved by 
adjusting the concentration of infectious virions in embodiments 
of the invention where the transformation is obtained by means 
of transduction,, or wherein, p rior to carrying out step <d) 
cells being transformed with more than one vector from said pool 
are substantially excluded from the further steps . This latter 
option requires that it is possible to quantify the number nf 
- transforming vectors and this can be achieved by including a" 
detectable marker in the expression product, e.g. a flourescent 
Probe. Another option is to rescreen cells which exhibit changes 
in phenotype, thereby ascertaining whether more than one vector 
nas transformed the cell. 

15 It will be understood that the molecule chosen for the purpose 
of being a scaffold, and wherein the random sequences are ulti- 
mately introduced, can be either a peptide sequence or a nucleic 
" ld Se ^ uence ' ""eh as an RNA fragment interfering (in an 
antisense-manner, with mRNA, tRNA or with a ribozyme. Alterna- 
20 txvely, such an RNA fragment could exhibit ribozyme activity 
itself, thereby having an indirect influence on the expression 
rate of other enzymes. At any rate, the resulting product, i. e 
the randomized expression product, can be a peptide or a ribonu- 
cleic acid such as a ribozyme. 

25 One important feature of the scaffold is, as mentioned above, 
that it is stable towards proteolytic attack and/or is insensi- 
tive to a reducing environment, such as the one which is found 
intracellularly. 

in preferred embodiments of the invention the random nucleotides 
■30 are introduced in part(s) of the parent nucleotide sequence 
which encode(s) the active site(s) of the parent peptide or 
Parent ribonucleic acid, or the part(s) which encode (s) struc- 

MAE BiotecVCefIScreea2 DK (,'PIC'l«g.07-20 
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ture( S ) interfering with the active site (a,. As discussed above, 
the act.ve site (as well as other exposed structures of the 
scaffold) need to be stable presented to the environment in 
order to be able to interact with other biomolecules . Hence 
preferably the in Variable part of the nucleotide ^ ^ 

codes truncated parts of the parent peptide or parent ribonu- 
cleic acid sufficient to maintain stability of the randomized 
product. 



in some embodiments it is preferred that the invariable part of 
10 the parent nucleotide sequence encodes a peptide which is free 
from disulfide bridges. This is, as mentioned in Example 2 
below, due to the fact that disulfide bridges are not formed in 
the nucleus or in the cytosol. Hence, in cases were the scaffold 
must be in a functional state when present in the nucleus or the 
15 cytosol, it would normally be preferable to use a scaffold which 
does not contain disulphide bridges or which do not rely on 
these in order to maintain stability and functionality. On the 
other hand, in embodiments where it is desired that the random- 
ized expression product is confined to the ER, or to another 
20 compartment allowing for the presence of stable disulphide 
bridges, before the randomized sequence is presented to the 
environment in a satisfactory manner, it is of course desirable 
that the invariable part of the parent nucleotide sequence 
encodes a peptide having disulphide bridges, because the chances 
25 of having a correctly folded and functional scaffold outside 
such compartments is relatively small. 

It will be understood that the random nucleotides are preferably 
introduced in the form of an insertion or a substitution into 
• the parent nucleotide sequence, optionally in combination with 
30 deletion^, in the parent nucleotide sequence. Deleted sequences 
" P ol >Wide could e.g. be parts of an active site 

the presence of which in unaltered form is toxic or otherwise ' 
deleterious to the transformed cells. 
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The number of random nucleotides induced can vary to a great 
9tee "°™ ally ^ 1- between 3 and about 

;vr ic 15 preferred thac at least 5 ' such as « ^ 

5 he o : ^ " lea " ~ - ^reduces. On 

r: ha d nd ' it is preferred that •< — 9o - » « —t 

70 or 80 random nucleotides are introduced. The most preferred 
I*" ! f """"""d randomized nucleotides varies between 15- 
60 preferably 20-55 or 2 5-50 nucleotides. EspeciaUy preferred 
numbers of random nucleotides are 18. 19. 20. „ „ L " 
10 25. 26, 27 , 28 , 29 , 30> 31f 33 _ 35 , asWV^O 

; 8 2, 44 ; ,s - »■ »■ «: s «: 11; : 

s'/ ^8, 59, and 60 nucleotides. 

The random nucleotides are introduced in the scaffold in the 

15 „ 7 T Cle0 " de Se<J ~ - the for™ of single random 

15 cleotrdes introduced at specific site5 in the parent nucleo- 
tide sequence. A variation is to substitute a part of the scaf- 

° SSqUenCe " Uh 3 -hich retains parts of the scaf- 

fold sequence (e.g. those which are essential for stabi- 
lity/functionality, but where other parts are randomized. 

20 The random nucleotides are preferably selected from the group 
consisting of ^ 

- synthetic, completely random deoxyribonucleotides - 

synthetic random DNA sequences, wherein limitation on 
randomization of some nucleotides is introduced so as to 
l.mit the number of available sequences and/or to avoid 
undesired stop codons and/or to facilitate introduction of 
post-translational modifications of expressed peptide (a)- 
synthetic random DNA sequences as in (1) or (2) coupled ^o 
a sequence encoding a purification tag; and 
30 - CDR encoding nucleotide sequences isolated from a li brary 
of immune-competent cells raised against an antigen (in 
this embodiment it is preferred that CDR encoding nucleo- 
tide sequences encode CDR-3 peptide sequences) . 
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The l«ter type of "randomization" actually introduces a re- 
striction on randomness which ensures that the sequences intro- 
duced encodes an antigen recognizing region. !t is, however, 
well known that a polyclonal immune response against an antigen 
5 consist of a large number of immune competent cells which ali 
react with the same antigen ,or perhaps even with the same 
eprtope, out the correlation between amino acid sciences of the 

zv:i: ecognition of the epuope,s) is 



An alternative way to introduce limitations on the randomness of 
the nucleic acid sequences which are ultimately tested in the 
substantially identical cells is the foliowing: Upon preparation 
Of the vectors, they are used in a 1- round of phage display 
where the phages transformed with the vectors are panned aganst 
15 a Ibrary containing a ligand of choice. as for the technique o 
employing CD* encoding sequences, the resuXt is that the se 
quences which are ultimately tested in the substantially identi- 
cal eel s are "unpredictable- ( and thereby random, but neverthe- 
less selected on the b^i^ * -r 
20 lack of kn functional feature. Again, the 

lack of known correlation between nucleic acid sequences and the 

Prod e u"t tl0 d three - dl — ••»=. ^tween the expression 
product and a ligand of choice has the consequense that the 
tested subgroup of sequences still is randomized. 

25 of t a he SPeCla \~-"' °' th. above-technique where the method 
of the invention is combined with phage display, both test 
systems are repeated in an alternating manner, that is a shuffl- 
ing between intracellular expression in the substantial identi- 
cal cells and panning of a phage library. 

30 the"^ " 0btai " " C ° ntrOUed ^"ribution of amino acids in 
the randomized peptides, when the modulator is a peptide, it is 
practice! that the random nucleotides are prepared by codon 
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spUt synthesis where defi „ ed DNfl codQns a ^ „ 

0rder ' 9 Ascription of this pri „ clple I'" 3 

- HO 96/38553 . The preferred embodiment in this 

- re, the relative amount of synthesized codons ensure ^ 

: it amino acids - ui be ™ c 

same frequency in encoded polypeptide fragments. 

:::: r ^z the randomized f —= * h . 

ectors, it is according to the invention preferred fhJ , ... 
random nucleotides are introduced into the expression"™ by 

eve/":;" °' "^"^ — — mutagenesis. How- 

ever, other optrons are k »„,„ to the skilled person , and it is 

possioie to insert synthetic random sequence libraries nto 

the vectors as well. into 

Apart from having the randomized fragment of the expression 
15 product introduced into a scaffr>iH ■ Passion 

,. . scaffold m accordance with the ore- 

sent invention, it is often necessary to couple the random 
«quence to a fusion partner by having the randomized nucleotide 
sequence fused to a nucieotide sequence encoding at ieast one 
fusion partner. Such a fusion partner can e.g. faciiitate „! 

zation of the expression product. 

a F °purmc PU t rPOSe3 ^ PUr " 1 ~"»' th * partner can include 

tat, e e " " t "' BSP ^otinyiation 

target sequence, of BirA, fl u tag, l, cz , and GST . Furthern]ore 

« the fusion partner may inclu de a sorting signai or a targeting 
sequence, cf. the discussions below. "geting 

in embodiments where the modulator is itself a modulator of 

enzyme activity, it is in theory possibie to effect both the K 

and/or the V„„ of the relevant enzvme a „h, * • ■ 

30 in less effectivity of the relevant dUCtl ° n " K » 

" relevant enzyme insofar that an 

-creased substrate concentration is required to obtain so* of 
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maximum activity of the enzvmp nn • 

enzyme. An increase of K... has the oodo- 
sate effect, of course ln ,„, " pp ° 

fects V h „ interference wrth an enzyme which ef- 

act vit7of th" * C ° nSeqUenCe th " tte ""l— Possible rate of 

5 dec d t e " Zyme 15 lnCreaSed '" hen - " -=-ased, or 

S decreased .when ,„„ ls increased, . ftt a„ y rate, the modula tor of 
the enzyme activity win g ive tne phen o t y pic impression ^ ° f 
enzyme activity has either been inhibited or stimulated. is 
preferred ln this embodiment that the modulator is an inhibitor. 

When the method of the invention has finally lead to th. identi- 
10 f at™ of a modulator or of a target therefor, it is preferred 
at the 3-dimensional structure of the identified mo LZ ^ 
resolved, srnce this allows for the implementation of rational 
drug screening and computer drug modelling methods. 

Use of the inventivo m . H , ods in _ . _ _ 

15 The originally envisaged technology disclosed in wo 96/38553 
focussed on screening for interactions in eu.aryotic cells 
However, the technology is also applicable in pro.aryotic sys- 



20 an ™! ' " " P ""« invention win 

HlZ 1 * nti """™ °< «'-«o unknown interactions i 

pathogeny bacteria, interactions which will be useful in the 

ZTan deV r° Pin9 — the inventive me 

thods allows for the identification of both novel ligand pep- 

25 r T f ^^^^ » «" - of the target molecu es 

sary tool f 93 ln """'"« " -™ the neces- 

sary tools for instigating computer drug modelling and for 

' tZItTh 9 "r 1 " 0 - 1 d ™« ™^ once such ligands and/or 
targets have been identified/isolated. 

30 e H ffe e c7' TV"" °' lden "^ln, antibacterial 

effects and substances, the method also opens up for improve- 
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-ents in industrial fermentation processes. In such cases it 
wall e.g. be possible to identify biomolecules which are impor- 
tant in the biochemical pathways in lactic acid bacteria and 
thereby provide tools for the production of new dairy products 
such as cheese, yoghourt, and other products of lactic acid 
bacterial fermentation. 

Somewhat related to this approach is the use of the methods of 
the present invention in screening performed on banter,,, „„,_ 
tures used in purification processes. It is weli-knowr/rn the 
art of e.g. waste water purification that the microbiological 
cultures (activated sludge, which conduct the degradation of 
organic material, are relatively vulnerable vis a vis changes in 
the environment and therefore the provision of more robust 
strains of bacteria would be one way to improve such systems. 
Alternatively, the method of the present invention would also 
allow for the identification/isolation of Uganda and targets in 
such bacterr, which, when interacted with, can lead to e g 
increased efficacy in degradation of specific organic or'inor- 
game substrates. 

20 As win be appreciated from the above, the present invention 
therefore is highly useful in prokaryotic systems. 

For the purposes of using the method of the invention in 
Prokaryotic cells, it is preferred that the prokaryotic cells 
are bacteria selected from the group consisting of Bacillus spp . 
25 e.g. B. anthracis, fl . subtiiis ^ fl ™ 

(e.g. c. Botulinum, C. difficile* r ■ 

airriciie, C. perfrmgens, and C. teta- 
ny. Corynebacterium s PP . (e.g. c. diphtheriae, and c. pyoge- 
nes), Staphylococcus spp. {e .g. s . aureus and s albicans) 
Streptococcus spp. (e.g. s. pneumoniae, S. pyogenes, and s. 
eaalactiae), Escherichia coli, Serratia marcescens, Klebsiella 
SPP. (e.g. K . pneumoniae), Proteus spp. (e.g. p. mirabilis) , 
dtrobacter spp. (e.g. Citrobacter freundii) , Salmonella spp. 
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(e.g. S. typhi, s. typhimurium, s. shottmulleri and S. paraty- 
phi), Shigella spp. (e.g. s. dystenteriae, S. flexneri, s 
boydii, and s , sonnei)f Pseudomonas spp _ (e g p aeruginQsat p 
pseudomallei, and P. MlIei) , Acinetobacter spp , ' 
spp., Plesiomonas s PP ., yersinia s PP . (e.g. y. pestis Y 
enterocolitica, and y. p S e Ud o tu , ercuJosis) , f „„ciseIIa tuiaren- 
sis. Vibrio spp. (e.g. v. choleras and V. paralyticus) , 
Campylobacter spp . (e . g . c . and Q ^ 

10 and *. aegypt.us), 5ordeteiia spp , {eg _ fi pertussiSf fl 

parapertussis, and B . bronchiseptica) , Brucella spp., ^ sseria 
SPP. (e.g. N . gonnorhoeae and N . meningitidis, Treponema palli- 
dum, Leptospira interrogans; Borrelia spp. (e . g . B . burgdorferi 

sensu stricto, B. garinii, B . afzelii ,nH n 

dxzeiii, and S. recurrentis), 
lb Legionella pneumophila. Li^r^r-i* _ 

P ^-ta, listeria monocytogenes, Mycobacterium 
SPP. (e g. *. tubercuJofliS/ „. ^ africanumi 

and Af. ieprae) , Treponema paiiio^, Chlamydia trachomatis, 
Actinomyces spp., Rick ettsia spp., and Mycoplasma spp. (e.g. * 
pneumoniae) . y 

20 This list of bacteria th u s entail, bacterial families and spe- 
cies whrch at. involved in pathology of a large number of dis- 
eases in humans. Of these, £ . coll and B . subtnis ^ 
for industrial fermentation; this is also the case for lactic 

25 a Tt h baC T ia ' n ° tably =PP- and L.ctob.clllu. spp. 

and therefore it is also preferred that the inventive methods, 
when employed in bacterial svstem^ « r 

syscems ' are Performed on such non- 
pathogenic species. 

When using the inventive methods in a prokaryotic system, it is 
very often interesting to identify those cells which are im- 
paired in growth or lethally damaged due to the presence of the 
heterologous expression product introduced according to the 
mention. This, however, is not completely unproblematic 



since 
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the main phenotypic trait associated with e.g. bacterxal death 

is absence of the bacterium. 



It is therefore necessary to device an experimental setup which 
wxll allow identification of cells transformed so as to be less 
5 good survivors than other otherwise corresponding cells One 
advantage is that expression of the heterologous genetic mate- 
rxal is under the control of an inducible promoter. In this way 
it is possible to expand colonies of cells which b — 

transformed with genetic material which, when expressed"^ 
10 lethal or growth-impairing to the cells. After that, the expres- 
sxon can be switched on and careful examination of expanded 
colonies should reveal those clonal colonies which do not follow 
the same growth pattern as e.g. an untransf ormed control. 

One method of doing this is to spread transformed cells on 
15 plates with growth medium and allow the cells to grow up to a 
pre-determined average size. The spreading of cells should be 
such that the visible colonies forming will generally be com- 
posed on one single clone of cells. When the pre-determined 
size of colonies have been reached, all plates are blotted to a 
20 carrier medium so as to prepare a "negative" of each agar plate 
After this, the expression of the inserted random sequences is 
induced and the colonies are allowed to grow again. The plates 
are examined continuously or at suitable intervals (e.g by 
means of digital image processing systems well known to the 
25 skilled person). Those colonies which reveal an impaired or 
arrested growth compared to the remainder of the colonies or 
compared to controls are thereafter identified, since the growth 
pattern of each colony in an automated manner can be followed 
. These colonies can then be identified and isolated from the 
30 "negative" blot and it is thereafter a relatively simple proce- 
dure to extract the transforming genetic material and determine 
the sequence thereof. 
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In this context, one interesting option is to render antibiotic- 
resxstant bacteria non-resistant. The growth medium either 
contains, or is during culturing enriched with, the antibiotic 
in question and the colonies which upon induction of exnr.ssion 
5 can be demonstrated to be less drug resistant than control', axe 
examined further. Also pure bactericidal/bacteriostatic effects 
can be examined, m SUC h an embodiment, the bacteria are e a 
cultured on a suitable growth medium. Those colonies which 'after 
induction of expression shows evidence of reduced or arrested 
growth are examined further: It is expected that some of the 
bacterial cells will be demonstrated to carry genetic material 
encoding an expression product which interacts with novel (or 
known) targets for antibacterial agents. 

Another phenotypic trait of interest is of course superior 
15 sur V i val of cells, it is, when dealing with utilisable bacteria 
of interest to identify targets which will increase the 
rate of the bacteria. Per example, bacteria used in industrial 
fermentations normally can be lethally damaged as a consequence 
of their own uncontrolled production of heterologous expression 
20 products. If genes or target molecules can be identified which 
have a positive effect on the survival of such bacteria, the 
economic potential is enormous, since a fermentation process 
will be rendered more economic (less need to startup of new 
fermentations). Similarly, bacteria used in e.g. waste water 
purification can be made more resistant against toxic agents i„ 
their environment. 



30 



The experimental setup in this context is relatively simple: Tne 
transformed bacteria are simply subjected to the potentially 
lethal condition, and only colonies which exhibit a superior 
survxval are isolated and examined (and that will typically be 
the colonies which are detectable). A setup li ke the above- 
described for identifying cell death should thus not be neces- 
sary . 
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t I', 9r ° UP ° f Phe "° t ^ i = '"its to be examined are 

those wh.ch can be detected by e.g. biochemical or immunological 
means. It is expected that the method „f the invention will 
.now for identification of systems in bacterial cells which 
when properly modulated, can render the bacteria more effective 

clld r °e ln h indUStrial such phenotypic traits 

could e.g. be changes In enzyme activity, changes in receptor 
density, changes in expression rate etc. 

Sp.ci.1 considerations apply „ he „ the randomized expression 
10 product is fused to a fusion partner which decides the final 

locate of the expression product. Signal sequences in ptoka „. 
es are well- k nown in the art, but it should briefly be .en- " 

P s°;: b le r — * - k „ow„, and it is also 

possxble to export the expression product to the periplasm 
15 space of bacteria. Finally , it is also possible ^ J 

secretion signals S o as to allow the isolation of the expression 
Product fro, culture supernatant. However, in many cases it r 
of course most relevant to k eep the expression product Inside 
the prokaryotic cytoplasm. 

20 U- Se °f the method i„ ...^ r vorif . . Y p ^.„„ 

It is especially preferred that the Inventive method utilises 
eu.ar c cells as the substantially identical cells in order 
to alio w screening for active biomolecules . Hence, these eu k ary- 

25 a d T CeUS ' «»<*<»<>■» -Us. animal cells, 

25 and plant cells. 



15 the case for bacteria, a number of fungi are pathogen, in 
-mmals. and therefore the present technology will. in p Lnel 
-ith what has been described above concerning antibacterial 

30 pa""' 5 ' ^ r 6 ^ 1 ide " tifyi ^ antifunga! agents by using 

30 pathogeny fungi as the substantial identical cells in the 
method. Furthermore, fungi Especially yeast strains,, i ike 
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bacteria, are also utilised in fermentation processes (e.g in 
the wine and brewing industries), and the method can therefore 
also be utilised using such non-pathogenic fungi as the substan- 
tially identical cells which are transformed with the vectors 
5 whereby improvements in these strains can be obtained. 

Preferred examples of fungi serving as the eukaryotic cell in 
the inventive methods are Epidermophyton spp., Trichophyton 
spp., Microsporum spp., Candida albicans, Philophora spp., 
Coccidioides immitis, Histoplasma capsulatum, Blastomyces 
10 der/natitidis, Paracoccidioides brasiliensis, Cryptococcus 

neoformans, Asperaillw; «?nn <? = ^~k 

, ^ctyiijus spp., Saccharomyces cerevisiae, 

Klyveromyces lactis, and Piccia pastoris. 

Unlike fungal cells, protozoan cells are only relevant as patho- 
gens for humans and other mammals, although some protozoans form 
15 part of biocultures conducting biological waste water purifica- 
tion. The method of the invention is therefore contemplated to 
be useful in identifying new targets for antiprotozoan agents 
In this context, the preferred protozoan cells used as the 
substantially identical cell in the methods of the invention are 
20 selected from the group consisting of Giardia lamblia, Trichomo- 
nas vaginalis, Dientamoeba fragilis, Trypanosoma spp., Leishma- 
ma spp., Entamoeba histolytica, Naegleria fowleri, Acanthamoeba 
castellani, Harmanella spp., Isospora belli, Cryptosporidium 
spp., Sarcocystis spp., Toxoplasma gondii, Plasmodium spp. (e g 
25 P. falciparum, P. vivax, P. malariae, P. knowlesi, and P. ova- 
le), Babesia spp., and Balantidium coli. 

Also plant cells are according to the invention interesting as 
target eukaryotic cells. The plant cells can be any plant cell 
which can be subjected to genetic engineering techniques allow- 
ing for single cell expression and growth. Thus, cells derived 
from e.g. Nicotiana tabacum (tobacco plant), Arabidopsis thali- 
ana, Brassica napus, Brassica juncea, Musa sp. (banana plants) 



30 
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rice, and corn are examples of plant cells useful in the inven- 
tion. The skilled person in the art of plant genetic engineering 
will know to choose suitable plant cells in the appropriate 
stage of their life cycle, suitable vector systems as well as 
5 suitable transformation methods. A short summary is given here: 

As other organisms, plant cells can be transformed with 
foreign DNA. One strategy for plant transformation employs 
Agrobacterium tumefaciens, a natural lw r,^,,r-v,- 
pathogenic bacterium which contains a plasmid (the Ti 
plasmid) with the ability to enter plant cells and insert a 
portion of its genome into plant chromosomes. The Ti plas- 
mid has been engineered to make it a vector for plant 
transformation by including sequences for replication in E. 
coli and Agrobacterium, unique restriction sites for in- 
15 serting foreign genes, and selectable markers. 

Unfortunately, Agrobacterium is not very effective at 
transforming monocots, a large group of plants that in- 
cludes all of the agriculturally important cereals. Le- 
gumes, another important group of food crops, are also 
difficult to transform and regenerate with Agrobacterium. 
Therefore, a second strategy for transforming plants has 
been developed involving the Gene Gun, where a gene is 
inserted into an expression vector and coated onto beads. 
The DNA-coated beads are then introduced by means of the 
25 gene gun into plant cells, where a small fraction are taken 

up and incorporated into the DNA. Individual plant cells, 
callus, regenerating shoots, and embryos are all suitable 
targets in this technique. 



To determine whether cells have actually incorporated 
foreign DNA and become transgenic, reporter genes such as 
GUS and Luciferase genes are used. In any case, foreign 
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genes must be flanked by a plant promoter in order to be 
expressed . 



It is preferred to use cells derived from animals. These cells 
can be mammalian cells, arthropod cells such as insect cells 
5 avian cells, and piscine cells. A number of reasons can be 

listed for using such cell types which each require a relevant 
choice of transformation and expression systems, growth condi- 
tions, etc, all easily determined and chosen by the skilled 
person. It suffices to note that e.g. certain insects cause 
10 enormous problems in human society (due to their direct damaging 
activities or due to their functions as vectors carrying infec- 
tious agents), and therefore the method of the invention would 
supplement in the attempts of controlling such insects. As for 
the mammals, birds and fish, a number of these are important in 
15 agri- and aquaculture, where disease control is of interest. 

According to the invention mammalian cells such as human 
cells/cell clones or human cell lines are most preferred. This 
is due to the fact that a large number of diseases in humans and 
other mammals etiologically depend on molecular interactions in 

20 the living cell - the provision of drugs or lead compounds which 
interact in vivo with biomolecules which play a role in diseases 
is therefore of great interest. Preferred mammalian cells are 
Chinese hamster ovary (CHO) cells, VERO cells, HeLa cells, W138 
cells, BHK cells, COS-7 293 cells, and MDCK cells, which are all 

25 well-known in the art. 

The vector is preferably selected from the group consisting of a 
retroviral vector, a vaccinia virus vector, an adenoviral vec- 
tor, an adeno associated virus (AAV) vector, a herpes simplex 
virus (HSV) vector, an alpha virus vector, and a semliki forest 
30 virus vector. 
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The candidate nucleic acids are hence introduced into eukaryotic 
cells as part of a vector to screen for modulators of target 
enzyme activity. By the term "introduced into" is herein meant 
that the nucleic acids enter the cells in a manner suitable for 
5 subsequent expression of the nucleic acid. The method of intro- 
duction is largely dictated by the targeted cell type, cf 
below. Exemplary, but non-limiting methods include CaPO, preci- 
pitation, liposome fusion, lipofectin®, electroporation, viral 
infection, etc. 



10 



The randomly modified nucleic acids are preferably integrated 
into the host cell genome (e.g. by means of retroviral infection 
of the host cell), or may exist either transiently or stably in 
the cytoplasm (i.e. through the use of traditional plasmids 
utilizing standard regulatory sequences, selection markers 
15 etc.). as many pharmaceutical^ important screens require human 
or model mammalian cell targets, retroviral vectors capable of 
transfecting such targets are preferred. 

In a preferred embodiment, the candidate nucleic acids are part 
of a retroviral virion which infects the cells. Generally 
20 infection of the cells is straightforward with the application 
of the infection-enhancing reagent polybene. Infection can be 
optimized such that the cells predominantly express a single 
construct each, e.g. by using tne ratio Qf virus particles tQ 
the number of cells. Alternatively, it is possible to "screen 
25 out" cells which have been infected with more than one single 
virion, e.g. by quantitatively assessing a selection marker and 
only. The rate of infection is well-known to follow a Poisson 
distribution . 

^ A preferred embodiment of the invention where the substantially 
30 identical cells are eukaryotic thus comprises that step (a) is 
carried out by 
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(i) 



transfecting suitable packaging cells with vectors com- 
prising the randomly modified nucleotide sequences and 
which are integratable in virions produced by said packag- 
ing cells, 

5 (ii) culturing said transfected packaging cells in a culture 
medium under conditions which facilitates production by 
the packaging cells of virions containing the randomly 
modified nucleotide sequences, 
(iii) recovering and optionally concentrating said virions, and 
transducing said substantially identical cells with the 
virions . 



10 (iv) 



Thus, the candidate nucleic acids are introduced into the cells 
using retroviral vectors. Currently, the most efficient gene 
transfer methodologies harness the capacity of engineered vi- 
15 ruses, such as retroviruses, to bypass natural cellular barriers 
to exogenous nucleic acid uptake. The use is well-known in the 
art of helper-defective packaging cell-lines which are capable 
of producing all necessary trans proteins (gag, pol, and env) 
required for packaging, processing, reverse transcription, and 
20 integration of recombinant genomes. Those RNA molecules which 
have in cis a * packaging signal are packaged into maturing 
virions. In eukaryotes, retroviruses are preferred for a number 
of reasons. First, their derivation is fairly easy. Second, 
unlike Adenovirus-mediated gene delivery, expression from retro- 
25 viruses is long-term (adenoviruses do not integrate) . Adeno- 
associated viruses have limited space for genes and regulatory 
units and there is some controversy as to their ability to 
integrate. Retroviruses therefore currently provide the best 
compromise in terms of long-term expression, genomic flexibility 
30 and stable integration, among other features. The main advantage 
of retroviruses is that their integration into the host genome 
allows for their stable transmission through cell division. This 
ensures that in cell types which undergo multiple independent 
maturation steps, such as hematopoietic cell progression, the 
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retrovirus construct will remain resident and continue to ex- 
press . 

Preferred retroviral vectors are derived from retrovirus se- 
lected from the group consisting of Avian Leukosis-Sarcoma Virus 
5 (ASLV), Mammalian type C, Mammalian type B, and Lentivirus They 
may all optionally be modified with heterologous cis-acting 
elements . 



It is preferred that the retroviral vector has non-identical 
ends so as to facilitate PCR-based generation of random DNA 
10 sequences. It is furthermore preferred that these non-identical 
ends contain non-identical promoters. 

An especially preferred retroviral vector contains a heterolo- 
gous promoter replacing the viral promoter in the S'-LTR, such 
as a CMV promoter, an RSV promoter, an SV-40 promoter, a TK 
15 promoter, an MT promoter, or an inducible system such as Tet or 
Ecdysone. A suitable vector is described in WO 96/38553, cf. 
Fig. 1 and accompanying text. 

Particularly well suited retroviral transaction systems are 
PE501 (US 4,861,719), Bosc23 (WO 94/19478), Psi2 (R . Mulligan/D 
20 Baltimore), GP+E86 (US 5,278,056), PhoenixEco (WO 97/27212) 
PA317 (US 4,861,719), GP + AM12 (US 5,278,056), DA(a mp ho) 
(WO 95/10601, WO 92/05266), Bing (WO 94/19478), FLYA13 
(WO 97/08330), ProPak ( available from SySternix) # CRIp (R> Mull ._ 
gan), PsiAM (R. Mulligan/D. Baltimore), Phoenix-Ampho 
25 (WO 97/27212), PG13 (Targeted Genetics), H9 (293GPG) (D. Ory, M 
Sadelain, R. Mulligan, J. Schaffer) . 

in one embodiment of the present invention, the library is 
generated in a retrovirus DNA construct backbone, as is gene- 
rally described in e.g. W0 97/27212. Standard oligonucleotide 
30 synthesis is performed to generate the random portion of the 
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candidate modulator, using techniques well known in the art (cf. 
Eckstein, Oligonucleotides and Analogues, A Practical Approach, 
IRL Press At Oxford University Press, 1991); libraries may be 
commercially purchased. Libraries with up to 10' unique sequen- 
5 ces can be readily generated in such DNA backbones. After gene- 
ration of the DNA library, the library is cloned into a first 
primer. The first primer serves as a "cassette" which is inser- 
ted into the retroviral construct. The first primer generally 
contains a number of elements, including for example, the 
10 required regulatory sequences (e.g. translation, transcription, 
promoters, etc), fusion partners, restriction endonuclease 
(cloning and subcloning) sites, stop codons (preferably in all 
three reading frames, regions of complementarity for second 
strand priming (preferably at the end of the stop codon region 
15 as minor deletions or insertions may occur in the random re- 
gion) , etc. 

A second primer is then added, which generally consists of some 
or all of the complementarity region to prime the first primer 
and optional necessary sequences for a second unique restriction 
20 site for subcloning. DNA polymerase is added to make double- 
stranded oligonucleotides. The double-stranded oligonucleotides 
are cleaved with the appropriate subcloning restriction 
endonucleases and subcloned into the target retroviral vectors, 
described below. 

25 Any number of suitable retroviral vectors may be used. Gene- 
rally, the retroviral vectors may include: selectable marker 
genes under the control of internal ribosome entry sites (IRES), 
which allows for bicistronic operons and thus greatly facili- 
tates the selection of cells expressing peptides at uniformly 

30 high levels; and promoters driving expression of a second gene, 
placed in sense or anti-sense relative to the 5--LTR (long 
terminal repeat). Suitable selection genes included, but are not 
limited to, neomycin, blastocidin, bleomycin, puromycin, and 
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hygromycin resistance genes, as well as sel f -fluorescent markers 
such as green fluorescent protein, enzymatic markers such as 
lacZ f and surface proteins such as CD8 , etc. 



Preferred vectors include vectors derived from retrovirus se- 
5 lected from the group consisting of Avian Leukos is-Sarcoma Virus 
(ASLV) , Mammalian type C, Mammalian type B, and Lentivirus as 
well as vectors derived from MSCV (murine stem cell virus), 
modified MFG virus ^nd pBABE, and optionally modified with 
heterologous cis-acting elements. 

10 The retroviruses may include inducible constitutive promoters. 
For example, there are situations wherein it is necessary to 
induce peptide expression only during certain phases of the 
selection process. For instance, a scheme to provide pro-inflam- 
matory cytokines in certain instances must include induced 

15 expression of the peptides. This is because there is some expec- 
tation that over-expressed pro-inflammatory drugs might in the 
long-term be detrimental to cell growth. Accordingly, constitu- 
tive expression is undesirable, and the peptide in only turned 
on during that phase of the selection process when the phenotype 

20 is required, and then the peptide is shut down by turning off 
the retroviral expression to confirm the effect or ensure long- 
term survival of the producer cells. A large number of both 
inducible and constitutive promoters are known to the skilled 
person . 



25 In addition, it is possible to configure a retroviral vector to 
allow inducible expression of retroviral inserts after integra- 
tion of a single vector in target cells; importantly, the entire 
system is contained within the single retrovirus. Tet-inducible 
retroviruses have been designed incorporating the Self-Inacti- 

30 vating (SIN) feature of 3 * LTR enhancer /promoter retroviral 

deletion mutant (Hoffmann et al., PNAS U.S.A. 93:5185 (1996)). 
Expression of this vector in cells is virtually undetectable in 

M&E Btotech/Ctfl Screen? DK l.TK/l W8-07-20 




32 

the presence of tetracycline or other active analogues. However, 
in the absence of Tet, expression is turned on to maximum within 
48 hours after induction, with uniform increased expression of 
the whole population of cells that harbour the inducible retro- 
5 virus, indicating that expression is regulated uniformly within 
the infected cell population, a similar, related system uses a 
mutated Tet DNA-binding domain such that it bound DNA in the 
presence of Tet, and was removed in the absence of Tet. Either 
of these systems is suitable. 

10 In this manner the primers create a library of fragments, each 
containing a different random nucleotide sequence within a 
scaffold sequence derived from genetic material encoding a 
enzyme modulator. The ligation products are then transformed 
into bacteria, such as E. coli and DNA is prepared from the 

15 resulting library, as is generally outlined in Kitamura, PNAS 
U.S.A. 92: 9146-50 (1995), which is incorporated by reference 
herein . 

Delivery of the library DNA into a retroviral packaging system 
results in conversion to infectious virus. Suitable retroviral 
20 packaging system cell lines include, but are in no way limited 
to, PE501, Bosc23, Psi2, GP+E86, PhoenixEco, PA317, GP+AM12, 
DA(ampho), Bing, FLYA13, ProPak, CRIP, PsiAM, Phoenix-Ampho, 
PG13, and H9 (293GPG) . 

In a preferred embodiment, the cell lines discussed above, and 
25 the other methods for producing retrovirus, are useful for 

production of virus by transient t ransf ect ion . The virus can be 
either used directly or used to infect another retroviral pro- 
ducer cell so as to expand the library. 

Concentration of virus may be done as follows: Generally, retro- 
30 viruses are titred by applying retrovirus-containing supernatant 
onto indicator cells, such as NIH3T3 cells, and then measuring 
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the percentage of cells expressing phenotypic consequences of 
infection. The concentration of the virus is determined by 
multiplying the percentage of cells infected by the dilution 
factor involved, and taking into account the number of target 
5 cells available to obtain a relative titre. If the retrovirus 
contains a reporter gene, such as lacZ , then infection, integra- 
tion, and expression of the recombinant virus is measured by 
histological staining for iacZ expression or by flow cytometry 
(FACS) . In general, retroviral titr»" rror, ~a e .__ 

. i . .„ ^ w ^ i j_ o-iii tveii Lilt; 

10 best of the producer cells do not exceed 10 7 per ml, unless 
concentration is performed on relatively expensive or exotic 
apparatus. However, it is believed that particles as large as 
retrovirus will not move very far by means of brownian motion in 
liquid, fluid dynamics predictions show that much of the virus 
15 never comes in contact with the cells in order to initiate 
infection. However, if cells are grown or placed on a porous 
filter surface and retrovirus are allowed to pass the cells by 
gradual gravitometric flow, a high concentration of virus around 
cells can be effectively maintained at all times. Thus, up to a 
20 ten-fold higher infectivity by infecting cells on a porous 

membrane and allowing retrovirus supernatant to flow past them 
has been seen. This should allow titres of 10 9 after concentra- 
tion . 



Upon isolation/concentration of virus, the substantially identi- 
25 cal target cells are transduced by methods well-known in the 



art 



As mentioned above, fusion of the expression product to at least 
one fusion partner which facilitates expression and/or purifica- 
tion/isolation and/or further stabilization of the expression 
30 product is often desired. 

In eukaryotes, the fusion partner is often a sorting signal or a 
targeting sequence. Such a sorting signal will be in the form of 
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a signal patch or a signal peptide. The well-known function of a 
sorting signal is to effect export of an expressed peptide into 
endoplasmic reticulum, into Golgi apparatus, into lysosomes, 
into secretory vesicles, into mitochondria, into peroxisomes, or 
5 into the nucleus. Of course, also export to the membrane or out 
of the cell are possibilities. Preferably, the sorting signal or 
targeting sequence is selected from the group consisting of 

a nuclear localization signal (NLS) such as Pro-Lys-Lys- 
Lys-Arg-Lys-Val (SV40 large T antigen NLS), Ala-Arg-Arg- 
10 Arg-Arg-Pro (human retinoic acid receptor-p NLS) , Glu-Glu- 

Val-Gln-Arg-Lys-Arg-Gln-Lys-Leu (NFkB p50) , Glu-Glu-Lys- 
Arg-Lys-Arg-Thr-Tyr-Glu (NFkB P 65), and Ala-Val-Lys-Arg- 

Pro-Ala~Ala-Thr-Lys-Lys-Ala-Gly-Gln-Ala-Lys-Lys-Lys-Lys- 
Leu-Asp (Xenopus nucleoplasmin NLS) ; 
15 ~ a membrane anchoring sequence such as those derived from 

CD8, I CAM- 2, IL-8, CD4, and LFA-1, and a lipidation se- 
quence such as a myristylation or a palmi toylation se- 
quence ; 

a lysosomal sorting signal such as a lysosomal degradation 
20 sequence, and a lysosomal membrane sequence 

a mitochondrial localization sequence such as a mitochon- 
drial matrix sequence, a mitochondrial inner membrane 
sequence, a mitochondrial int ermembrane space sequence, 
and a mitochondrial outer membrane sequence; an endoplas- 
25 mic reticulum localization sequence such as the sequence 

from calreticulin (KDEL) and the sequence from adenovirus 
E3/19K protein (LULSRRSFIDEKKMP) ; 

a peroxisome sequence such as the peroxisome matrix se- 
quence from Luciferase; 

30 ~ a f arnesylation sequence such as LNPPDESGPGCMSCKCVLS ; 

- a geranylgeranylation sequence such as LTEPTQPTRNQCCSN; 

a destruction sequence such as RTALGD1GN; and 
a secretory signal sequence such as the secretory signals 
from IL-2, growth hormone, preproinsul in , and influenza HA 

35 protein. 
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EnZ V me adulators useful in the immntinn 

The art has demonstrated the existence of numerous effective 
peptide enzyme activity modulators. Especially the enzyme 
inhibitors are wel 1 -character! zed . Non-limiting examples which 
are all incorporated by reference herein are listed in the 
f ol lowing : 



BPTl/Kunltz family of protease Inhibitors: 

Pancreatic trypsin inhibitor (BPTI) from Bos taurus; 
Spleen trypsin inhibitor from Bos taurus; 
10 Inter-alpha-trypsin inhibitor light chain (bikunin) from Bos 

taurus, Homo sapiens, Meriones unguiculatus , Mesocricetus aura- 
tus, Mus musculus, Sus scrofa, Pleuronectes platessa, and Rattus 
norvegicus, respectively; 

Inter-alpha-trypsin inhibitor from Equus caballus, Ovis aries, 
15 and Capra hircus, respectively; 

Hemolymph trypsin inhibitor A from Manduca sexta; 
Hemolymph trypsin inhibitor B from Manduca sexta; 
Colostrum trypsin inhibitor from Bos taurus; 
Trypstatin from Rattus norvegicus; 
20 Proteinase inhibitor from Tachypleus tridentatus; 
Serum basic protease inhibitor from Bos taurus; 
Chymotrypsin inhibitor SCI-IH from Bombyx mori; 

Male accessory gland serine-protease inhibitor from Drosophila 

funebris; 

25 Protease inhibitor 5 II from Anemonia sulcata; 

Chymotrypsin inhibitors SCI-I and SCI-II from Bombyx mori; 
Proteinase inhibitors SHPI and SHPI-2 from Stoichactis helian- 
thus; 

Isoinhibitor K from Helix pomatia; 
30 Trypsin inhibitor IV from Radianthus macrodactylus; 
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Venom basic protease inhibitors IX and VITIB from Bungarus 
fasciatus / 

Venom basic protease inhibitors I and III from Vipera ammodytes 
ammodytes ; 

5 Venom basic protease inhibitor II from Daboia russelli siamen- 
sis, Hemachatus haemachatus, and Naja nivea, respectively; 
Venom basic protease inhibitors B and E from Dendroaspis polyle- 
pis polylepis ; 

Venom chymot rypsin inhibitor from Naja naja; 
10 Venom basic protease inhibitors I and K from Dendroaspis polyle- 
pis polylepis; 

Venom basic protease inhibitor K from Dendroaspis angusticeps 

Venom trypsin inhibitor from Eristocophis macmahoni and Naja 

naja, respectively; 
15 Protease inhibitor from Sarcophaga bullata; 

Tissue factor pathway inhibitor from Homo sapiens, Oryctolagus 

cuniculus , and Rattus norvegicus, respectively; 

Tissue factor pathway inhibitor 2 from Homo sapiens; 

Uterine pi asmin/ trypsin inhibitor from Sus scrofa; 
20 Protease nexin II (fragment of Alzheimer's disease amyloid A4 

protein) from Homo sapiens, Mus musculus, rattus norvegicus, 

Macaca fascicularis and Macaca mulatta, respectively; 

Amyloid protein 2 from Homo sapiens and Rattus norvegicus, 

respectively; and 
25 Ornithodorin from Ornithodoros moubata, 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 

Serpln family of protease Inhibitors: 

Alpha-l-proteinase inhibitors (alpha-l-ant it rypsins) f rom Equus 
30 caballus, Mus musculus, Cavia porcellus, Oryctolagus cuniculus, 
Bos taurus, Chinchilla villidera, Didelphis marsurpiales 
virginiana , Homo sapiens, Macropus eugenii, Mus caroli, Papio 
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anubis, Sus scrofa, Rattus norvegicus, and Ovis aries, respec- 
tively; 

Alpha-l-antichymotrypsin from Homo sapiens; 

Antithrombin III from Homo sapiens, Bos taurus, Mas musculus, 
5 0 V1S aries, Mesocricetus auratus, and Gallus gallus, respec- 
tively; 

Alpha-2- P las mi n inhibitor (alpha-2-antiplasmin) from Bos taurus 
Homo sapiens, and Mus musculus, respectively; 
Bomapin (Protease Inhibitor 10) f rom Homo sapiens; 
10 Contrapsin from Mus musculus and Cavia porce"lll7-' 

Contrapsin-like protease inhibitors from Rattus norvegicus; 
Factor Xlla inhibitor from Bos taurus; 

Glia derived nexin (protease nexin I) 'from Homo sapiens, Mus 
musculus, and Rattus norvegicus, respectively; 
15 Heparin co-factor II from Homo sapiens, Mus musculus, Oryctola- 
gus cuniculus, and Rattus norvegicus, respectively; 
41 kDa heat shock protein (serine protease inhibitor J6 , from 
Mus musculus and Gallus gallus, respectively; 
Cl-inhibitor from Homo sapiens; 
20 Leukocyte elastase inhibitor from Eguus cacallus, Homo sapiens, 
and Sus scrofa, respectively; 
Protein C inhibitor from Homo sapiens; 
Kallistatin from Homo sapiens; 

Kallikrein-binding protein from Mus musculus and Rattus norvegi- 
cus, respectively; 

Maspin from Homo sapiens, Mus musculus, and Rattus norvegicus 
respectively; 

Plasminogen activator inhibitor-l f rom Bo, taurus. Ho„o sapiens 
Mus nusculus, MusteU vison, and „„ rveJiCTS , respective- 

30 ly; 

Plasminogen activator inhibitor-2 from Homo sapiens, Mus muscu- 
lus, and Rattus norvegicus, respectively; 

Neuroserpin from Homo sapiens, Mus musculus, and Gallus gallus 
Cytoplasmic antiproteinases 1, 2 and 3 from Homo sapiens 
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Antitrypsin from Bombyx mori, respectively; 
Antichymotrypsins I and II from Bombyx mori; 
Alaserpin from Manduca sexta; 

Serine protease inhibitor 2.1 from Rattus norvegicus; 
5 Serine proteinase inhibitor 1 from Cowpox virus, Rabbit pox 
virus, Swine pox virus, Vaccinia virus, and Variola virus, 
respectively; 

Serine proteinase inhibitor 2 from Rabbit pox virus, Vaccinia 
virus, and Variola virus, respectively; 
10 Serine proteinase inhibitor 3 from Vaccinia virus and Variola 
virus, respectively; 
Serpin 1 from Myxoma virus; 

Serine proteinase inhibitor from Halocynthia roretzi; 
Ice inhibitor from Cowpox virus (thiol-protease inhibitor); 
15 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Kazal family of protease Inhibitors : 

Acrosin inhibitors from Bos taurus, Homo sapiens, Sus scrofa, 
and Macaca fascicular is , respectively; 
20 Elastase inhibitor from Anemonia sulcata; 
Ovoinhibitor from Gallus gallus; 
Rhodniin from Rhodnius prolixus; 

Pancreatic secretory trypsin inhibitor from Rattus norvegicus, 
Anguilla anguilla, Bos taurus, Canis familiaris, Homo sapiens, 
25 Sus scrofa and Ovis aries f respectively; 

Pancreatic secretory trypsin inhibitor II from Rattus norvegi- 
cus; 

Double-headed protease inhibitor (submandibular gland) from 
Canis familiaris, Fells silvestris catus, Meles meles, Panthera 
30 leo, and Vulpes vulpes, respectively; 

Trypsin inhibitor from Halocynthia roretzi; 
Tryptase inhibitor from Hirudo medicinalis; 
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Prostatic secretory glycoprotein from Mus musculus ; 
Ovomucoid (third domain) from Aburria pipile, Aepypodius 
arfakianus, Afropavo congensis, Alectoris chukar, Alectoris 
rufa, Anas platyrhynchos , Chloephaga picta, Cyanochen cyanop- 
5 tera, Neochen jubata, Tadorna radjah, Lophonetta specularioides , 
Anas capensis, Aix galericulata , Aix sponsa, Sarkidiornis 
melanotos, Alopochen aegyptiaca, Mergus cucullatus, Anhinga 
novaehollandiae, Anser anser anser, Anser indicus, Anseranas 
semipalmata, Arborophila torqnrnla, Argusianus arg US/ Aychya 
10 americana, Netta rufina, Balearica pavonina, Bambusicola 

thoracica, Bonasa umbellus, Branta canadensis, Anser canagicus, 
Callipepla squamata castanogastric , Callipepla squamata pallida, 
Carpodacus mexicanus, Carpococcyx renauldi, Casuarius casuarius, 
Casuarius bennetti, Cereopsis novaehollandiae, Chauna chavaria, 
15 Chauna torquata , Callus gallus, Chrysolophus amherstiae, 

Chrysolophus pictus. Circus aeruginosus, Colinus virginianus. 
Corpus albus, Corvus monedula, Coscoroba coscoroba, Coturnix 
delegorguei, Coturnix coturnix japonica, Crossoptilon crossopti- 
Ion, Cygnus atratus, Cygnus olor, Oxyura jamaicensis, Oxyura 
20 vittata, Cyrtonyx montezumae, Dacelo novaeguineae , Dendrocygna 
arborea, Dendrocygna arcuata, Dendrocygna autumnalis, Dendro- 
cygna bicolor, Dendrocygna eytoni, Dendrocygna viduata, Dromaius 
novae-hollandiae, Eudromia elegans, Francolinus afer coqui, 
Francolinus erckelii, Francolinus francolinus, Francolinus 
25 pondicerianus, Fulica atra, Gallinula chloropus, Gallirallus 
australis, Callus varius, Geococcyx californianus, Grus 
carunculatus, Grus japonensis, Grus vipio, Anthropoides virgo, 
Guira guira, Guttera pucherani, Gyps coprotheres , Polyboroides 
radiatus, Aquila audax, Necrosyrtes monachus, Haliaeetus albi- 
30 cilia, Haliastur Indus, Larus ridibundus, Larus marinus, Vanel- 
lus spinosus, Leipoa ocellata, Lophu ra bulweri, Lophortyx 
californica, Lophortyx gambelii, Lophura ignita, Lophura diardi, 
Lophura leucomelana, Megapodius freycinet, Meleagris gallopavo, 
Agriocharis ocellata, Nothoprocta cinerascens, Nothoprocta 



MAE Biotectvrdf&crecrC DK 1TK/IW8-O7-20 



40 



perdicaria, Numida meleagris, Acryllium vulturinum, Nycticorax 
nycticorax, Opisthocomus hoazin, Oreortyx pictus, Ortalis vetu- 
la, Pavo cristatus, Pavo muticus, Penelope jacquacu, Penelope 
superciliaris, Perdrix perdrix, Phasianus colchicus colchicus, 
5 Phasianus versicolor, Phalacrocorax sulcirostris , Podargus 
strigoides, Polyplectron bicalcaratum , Polyplectron emphanum, 
Polyborus plancus, Pygoscelis adelie, Phalacrocorax albiventer, 
Rhea americana, Pterocnemia pennata, Rhynchotus ru fescens, 
Rollulus roulroul, Scythrops novaehollandiae , Spheniscus hum- 
10 boldti, Struthio camelus, Syrmaticus mikado, Syrmaticus ree.es- 
ii, Tinamvs major, Tardus merula, Turnix sylvatica, Tympanuchus 
cupido, Centrocercus urophasianus , Tragopan blythii, Tragopan 
caboti, Tragopan satyra, Tragopan temminckii, Lophophorus 
impejanus, Crossoptilon auritum, Crossoptilon mantchuricum , 
15 Lophura edwardsi, Lophura nycthemera, Lophura swinhoei, Pucrasia 
macrolopha, Catreus wallichii, Syrmaticus ellioti, Syrmaticus 
humiae, Syrmaticus soemmerringii , Lagopus leucurus, and Vultur 
gryphus, respectively, 

as well as inhibitors homologous therewith isolated from other 
20 sources than those explicitly mentioned. 

Soybean trypsin Inhibitor (Kunitz) family of protease inhibi- 
tors : 

Aspartic proteinase inhibitor from Solanum tuberosum; 
Cathepsin D inhibitors from Solanum tuberosum; 
25 Wound-induced aspartate proteinase inhibitor from Solanum 
tuberosum; 

Chymotrypsin inhibitor 3 precursor from Psophocarpus 
tetragonolobus; 

Trypsin inhibitor from Adenan thera pavonina; 
30 Trypsin inhibitor from Prosopsis juliflora; 
Trypsin inhibitor from Erythrina caffra; 
Trypsin inhibitor from Erythrina latissima; 
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Chymotrypsin inhibitor from Erythrina variegate; 

Trypsin inhibitors 1A, IB, and 2 from Psopnocarpus tetragonolo- 
bus; 

Trypsin/chymotrypsin inhibitor from Alocasia macrorrhiza; 

5 Trypsin inhibitor from Albizzia julibrissin; 

Trypsin inhibitor from Acacia confusa; 

Trypsin inhibitors A, B and C from Glycine max; 

Trypsin inhibitors KTIland KTI2 from Glycine max; 

Latex serine proteinase inhibitor fm m 

10 Cysteine proteinase inhibitor pcpt a ? c ? 

uuor H - J from Solanum tuberosum; 

Kunitz-type inhibitors 1 and 2 rPKT-1 s o t 

"° z lFKI 1 > from 5oJanum tuberosum; 

Potato inhibitor J family protease inhibitors;.- 

Trypsin/subtilisin inhibitor from Amaranthus caudatus; 
Subtilisin inhibitor from Momordica charantia; 
15 Wound-induced proteinase inhibitor I from Lycopersicon esculen- 
tum, Lycopersicon peruvianum, and Solanum tuberosum, respec- 
tively; 

Subtilisin inhibitors I and II f rora Phaseolus angularis; 
Proteinase inhibitor I f rom Solanum tuberosum; 
20 Chymotrypsin inhibitor 2A (CI-2A) f rom Hordeum vulgare; 
Chymotrypsin inhibitor 2B (CI-2B) from Hordeum vulgare; 
Chymotrypsin inhibitor 1A (CI-1A) from Hordeum vulgare; 
Chymotrypsin inhibitor IB (CI-lB) from Hordeum vulgare; 
Chymotrypsin inhibitor 1C (CI-1C) from Hordeum vulgare; 
25 Chymotrypsin inhibitor I, a, b and c subunits from Solanum 
tuberosum; 

Subtilisin inhibitor from Vicia faba; 

Ethylene-responsive proteinase inhibitor from Lycopersicon 

esculentum; 

30 Proteinase inhibitors I-A and I-B from Nicotians tabacum; 
Eglin C from Hirudo medicinalis; 

Inhibitor of trypsin and Hageman factor from Cucurbita maxima; 
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Trypsin inhibitor MCI-3 from Momordica charantia; and 
Trypsin inhibitor I from Nicotiana sylvestris , 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Bowman -Blrk family of protease Inhibitors : 

Bowman-birk type protease inhibitors from Arachis hypogaea , Coix 
lachryma-jobi , Phaseolus angularis f Glycine max, Triticum aesti- 
vum, Arachis hypogaea , Phaseolus vulgaris, Setaria italica, 
Dolichos axillaris , Lonchocarpus capassa, Oryza sativa, Hordeum 
vulgare, Medicago scutellata , Phaseolus aureus, Phaseolus luna- 
tus, Vicia angusti folia , Vicia faba, and Vigna unguiculata, 
respectively; and 

Wound induced trypsin inhibitors from Medicago sativa and Zea 
mays, respectively, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Squash Inhibitor family: 

Elastase inhibitor from Momordica charantia ; 

Trypsin inhibitors I, II, and III from Lagenaria leucantha; 

Trypsin inhibitor I from Citrullus vulgaris; 

Trypsin inhibitors I, III, and IV from Cucurjbita maxima; 

Trypsin inhibitors I and II from Luffa cylindrica; 

Trypsin inhibitors I and II from Momordica charantia; 

Trypsin inhibitor I from Momordica repens; 

Trypsin inhibitor II from Bryonia dioica; 

Trypsin inhibitors IIB and IV from Cucumis sativus; 

Trypsin inhibitor II from Ecballium elaterium; 

Trypsin inhibitors I, II and III from Cucumis melo var. Conomon; 

Trypsin inhibitors II and III from Cucurbits pepo; and 

Trypsin inhibitor A from Momordica charantia , 
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as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Streptomyces subtlllsln Inhibitor (SSI) family: 

Subtilisin inhibitor from Streptomyces albogriseolus; 
5 Subtilisin inhibitor-like protein-1 from Streptomyces cacaoi; 
Trypsin inhibitor STI2 from Streptomyces longisporvs; 
Subtilisin inhibitor-like protein-2 fium SLzepLomyces rochei; 
Subtilisin inh ibi t or- 1 i ke protein-3 from Streptomyces coeli- 
color; 

10 Subtilisin inhibitor- 1 i ke protein-4 from Streptomyces laven- 
dulae; 

Subtilisin inhibitor- 1 i ke protein-8 from Streptomyces virginiae; 
Protease inhibitor SIL-V3 from Streptoverticil Hum euroc idicus ; 
Protease inhibitor SIL-V1 /SIL-V4 from Streptoverticillium 
15 f lavopersicus ; 

Protease inhibitor SIL-V5 from Streptoverticillium luteoverti- 
cillatus; 

Protease inhibitor SIL-V2 from Streptoverticillium orinoci; 
Alkaline protease inhibitor 2C from Streptomyces griseoincar- 
20 natus; 

Protease inhibitor from Streptomyces lividans; and 
Plasminostrept in from Streptomyces antifibrinolyticus, 
as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

2 5 Bombyx family of protease Inhibitors 

Fungal protease inhibitor F (FPI-F) from Bombyx mori r as well as 
inhibitors homologous therewith isolated from other sources than 
Bombyx mori . 
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Wap-type 9 Four- <±L sulphide Core' Proteinase Inhibitors 

Antileukoproteinase 1 from Homo sapiens and Mus musculus, re- 
spectively; 

Elafin from Homo sapiens and Sus scrofa, respectively; and 



as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Hirudin family of protease Inhibitors 

Hirudins from Hirudo medicinalis and Hirudinaria mani liens is , 
10 respectively, as well as inhibitors homologous therewith iso- 
lated from other sources than those explicitly mentioned. 

Factor Xa Inhibitors 

Antistasin from Haementeria officinalis, Haementeria ghilianii , 
Hirudo medicinal is , Hirudo nipponia, and Hydra magnipapillata r 
15 respectively, as well as inhibitors homologous therewith iso- 
lated from other sources than those explicitly mentioned. 

Ascaris trypsin Inhibitor family 

Chymotrypsin/elastase inhibitors from Ascaris suum and 
Trypsin inhibitor from Ascaris suum as well as inhibitors homo- 
20 logous therewith isolated from other sources than Ascaris suum. 

Cystatln family of protease Inhibitors 

Leukocyte cysteine proteinase inhibitors 1 and 2 from Sus scro- 



5 



Chelonianin (basic protease inhibitor) from red sea turtle, 



fa; 



Stefins 1, 2 and 3 from Mus musculus; 
25 Cystatins Al, A5, A8 and B from Sus scrofa; 
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Cystatins A and B from Bos taurus; 
Stefin C from Bos taurus; 
Cystatin B from Ovis aries; 

Cystatins A, C, D, M, SN, S and SA from Homo sapiens; 
5 Cystatins B and C from Mus musculus; 

Cystatins C and S from Rattus norvegicus; 

Cystatins C from Macaca mulatta and Saimiri sciureus, respec- 
tively; 

Cvstatin a 1 nh^ ( eoi ^m=> i k -; _ »_ . . , r 

10 Rattus norvegicus ; 

Cystatin B (liver thiol proteinase inhibitor) from Homo sapiens 
and Rattus norvegicus, respectively; 

Cystatin (colostrum thiol proteinase inhibitor) from Bos taurus; 

Cystatins from Gallus gallus and Coturnix coturnix japonica, 
15 respectively; 

Cystatin from Cyprinus carpio; 

Cystatin from Bitis arientas; 

Cystatin 1 from Zea mays; 

Oryzacystatin-I from Oryza sativa; 
20 Oryzacystatin-II from Oryza sativa; 

Cystatin A from Helianthus annuus; 

Cystatin B from Helianthus annuus; 

Cystatin from Vigna unguiculata; 

Onchocystatin from Onchocerca volvulus; 
25 Cysteine proteinase inhibitor from Solanum tuberosum; 

Cystatin WCPT-3 from Wisteria floribunda; 

Cystatin from Glycine max; and 

Kininogens from Bos taurus, Homo sapiens and Rattus norvegicus, 
respect ively , 

30 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Calpain family of cysteine protease inhibitors 

Calpastatin from Bos taurus, Cercopithecus aethiops, Homo sapi- 
ens, Mus musculus, Sus scrofa, Oryctolagus cuniculus, Rattus 
norvegicus , and Ovis aries, respectively, 
5 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Tissue inhibitor of metalloproteinases family 

Metalloproteinase inhibitor 1 from Bos taurus, Homo sapiens, Mus 
musculus, Papio cynocephalus , Sus scrofa, Oryctolagus cuniculus, 
10 Rattus norvegicus , and Ovis aries, respectively; 

Metalloproteinase inhibitor 2 from Bos taurus, Homo sapiens, Mus 
musculus, Rattus norvegicus, and Gallus gallus, respectively; 
and 

Metalloproteinase inhibitor 3 from Bos taurus, Homo sapiens, Mus 
15 musculus, Rattus norvegicus, and Gallus gallus, respectively, 
as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Carboxypeptidase A inhibitors 

Carboxypeptidase A inhibitor from Ascaris suum, as well as 
20 inhibitors homologous therewith isolated from other sources than 
Ascaris suum. 

Metallocarboxypeptidase inhibitors 

Metallocarboxypeptidase inhibitors from Lycopersicon esculentum 
and Solanum tuberosum, respectively, as well as inhibitors 
25 homologous therewith isolated from other sources than those 
explicitly mentioned. 
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Angiotensin- converting enzyme Inhibitors 

Angiotensin-converting enzyme inhibitor from Thunnus albacares; 
Angiotensin-convert ing enzyme inhibitors from Bothrops insula- 
ris; 

5 Angiotensin-converting enzyme inhibitors from Bothrops jarara 
Angiotensin-converting enzyme inhibitor from Agkistrodon halys 
blomhoff i ; 

An giotensin-convertiiiy enzyme inhibiLor from Agkistrodon halys 
pallas; and 

10 Angiotensin-converting enzyme inhibitor from Vipera aspis, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Protease Inhibitors not belonging to a specific family 

Ecotin from Escherichia coli; 
15 Metalloproteinase inhibitor from Streptomyces nigrescens; 

Proteinase inhibitor from Erwinia chrysanthemi; 

Proteinase inhibitor from Pseudomonas aeruginosa; 

Protease A inhibitor 3 from Saccharomyces cerevisiae; 

Protease B inhibitors 1 and 2 from Saccharomyces cerevisiae; 
20 Major pepsin inhibitor PI-3 from Ascaris suum; 

Intracellular proteinase inhibitor from Bacillus subtilis; 

Proteinase inhibitor PTI from Solanum tuberosum; 

Proteinase inhibitor PCI-I from Solanum tuberosum; 

Proteinase inhibitor IIA from Solanum tuberosum; 
25 Proteinase inhibitor IIB from Solanum tuberosum; 

Proteinase inhibitors A and B from Sagittaria sagitti folia ; 

Proteinase inhibitor from Solanum melongena; 

Trypsin inhibitor from Brassica napus; 

Trypsin inhibitor 2 from Sinapis alba; 
30 Trypsin inhibitor from Zea mays; 

Trypsin inhibitor from Sinapis arvensis; 
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Trypsin inhibitor from Trichosanthes kirilowii; 

Wound-induced proteinase inhibitor II from Lycopersicon esculen- 
tum; 

Protease inhibitors LCMI I and PMP-D2 from Locusts migratoria; 
5 Protease inhibitor from Bacillus brevis; 

Marinostatins C-l, C-2 and D from Alteromonas sp. ; and 
Host protease inhibitor from bacteriophage T4, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

10 Cereal alpha -amylase /trypsin Inhibitor family 

Alpha amylase/trypsin inhibitor CM1 from Tritlcum aestivum; 

Alpha amylase/trypsin inhibitor CM2 from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CM3 from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CM16 from Triticum aestivum; 
15 Alpha amylase/trypsin inhibitor CM17 from Triticum aestivum; 

Alpha amylase inhibitor 0.19 from Triticum aestivum; 

Alpha amylase inhibitor 0.28 from Triticum aestivum; 

Alpha amylase inhibitor 0.53 from Triticum aestivum; 

Alpha amylase inhibitor WDAI-3 from Triticum aestivum; 
20 Alpha amylase/trypsin inhibitor CM A from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMB from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMC from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMD from Hordeum vulgare; 

Alpha amylase inhibitor CME from Hordeum vulgare; 
25 Alpha amylase inhibitor BMAI-1 from Hordeum vulgare; 
Alpha amylase inhibitor BDAI-1 from Hordeum vulgare; 
Alpha amylase/trypsin inhibitor from Eleusine coracana; and 
Trypsin/factor Xlla inhibitor from Zea mays, 

as well as inhibitors homologous therewith isolated from other 
30 sources than those explicitly mentioned. 
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Alpha -amylase/ trypsin inhibitor homologous to thaumatin 

Alpha-amylase/trypsin inhibitor from Zea mays as well as inhibi- 
tors homologous therewith isolated from other sources than Zea 
mays . 

5 Alpha-amylase/subtilisin inhibitor family 

Alpha-amvlase/subt i 11 sin inhiKU^. rr 7 

Alpha-amylase/subtilisin inhibitor from Triticum aestivum; and 
Alpha-amylase/subtilisin inhibitor from Oryzae sative, 
as well as inhibitors homologous therewith isolated from other 
10 sources than those explicitly mentioned. 

Inhibitors of insect alpha -amylases 

Small protein inhibitor of insect alpha-amylases 1, 2 and 3 from 
Sorghum bicolor milo as well as inhibitors homologous therewith 
isolated from other sources than Sorghum bicolor milo. 

15 Inhibitors of mammalian alpha-amylases derived from Streptomyces 
species 

Alpha-amylase inhibitor HAIM I from Streptomyces griseosporeus ; 
Alpha-amylase inhibitor PAIM I from Streptomyces olivaceoviri- 
dis; 

20 Alpha-amylase inhibitor HAIM II from Strep to/nyces griseosporeus; 
Alpha-amylase inhibitor PAIM II from Streptomyces olivaceoviri- 
dis; 

Alpha-amylase inhibitor AI-3688 from Streptomyces aureofaciens ; 
Alpha-amylase inhibitor Z-2685 from Streptomyces rochei; and 
25 Alpha-amylase inhibitor HOE-4 67A from Streptomyces tendae, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Trehalase Inhibitors 

Trehalase inhibitor from Periplaneta americana as well as 
inhibitors homologous therewith isolated from other sources. 

Polygalacturonase Inhibitors 

5 Polygalacturonase inhibitors from Phaseolus vulgaris and Pyrus 
communis as well as inhibitors homologous therewith isolated 
from other sources than those explicitly mentioned. 

Tucosyl transferase Inhibl tors 

Fuctinins from Rattus norvegicus as well as inhibitors homolo- 
10 gous therewith isolated from other sources. 

Protein kinase C inhibitors 

14-3-3 Protein beta/alpha from Bos taurus, Ovis aries, Homo 
sapiens , and Rattus norvegicus , respectively; 
14-3-3 Protein epsilon from Mus musculus , Ovis aries, Homo 
15 sapiens, and Rattus norvegicus , respectively; 

14-3-3 Protein eta from Bos taurus, Mus musculus , and Rattus 
norvegi cus, respectively; 

14-3-3 Protein gamma from Bos taurus, Ovis aries, and Rattus 

norvegicus, respectively; 
20 14-3-3 Protein zeta/delta from Bos taurus, Mus musculus , Ovis 

aries, Homo sapiens , and .Rattus norvegicus, respectively; 

Hint protein from Bos taurus, Rattus norvegicus , Homo sapiens, 

Mus musculus , and Oryctolagus cuniculus , respectively; and 

14 kDa zinc-binding protein from Brassica juncea and Zea mays, 
25 respectively, 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 
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cAMP -dependent protein kinase Inhibitors 

cAMP-dependent protein kinase inhibitor (muscle/brain form) 
from Homo sapiens, Oryctolagus cuniculus , Mus musculus, and 
Rattus norvegicus, respectively, and 
5 cAMP-dependent protein kinase inhibitor (testis form) from Mus 
musculus , and Rattus norvegicus, respectively, 

as well as inhibitors homologous therewith isolated from other 
sources than thcst: explicitly mentioned. 

Cyclic nucleotide phosphodiesterase inhibitor 

10 Cyclic nucleotide phosphodiesterase inhibitor from Dictyostelium 
discoideum as well as inhibitors homologous therewith isolated 
from other sources. 

Protein phosphatase inhibitors 

Protein phosphatase inhibitor 1 from Homo sapiens, Oryctolagus 
15 cuniculus , and Rattus norvegicus , respectively; 

Protein phosphatase inhibitor 2 from Homo sapiens, Oryctolagus 

cuniculus , and Rattus norvegicus , respectively; 

Heat-stable protein phosphatase 2a inhibitor I1PP2A from Bos 

taurus and Homo sapiens, respectively; and 
20 Phosphatase RAPA inhibitor from Bacillus subtilis, 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 

TCD/MRS6 family of GDP dissociation inhibitors 

Secretory pathway GDP dissociation inhibitor from Saccharomyces 
25 cerevisiae; 

Rab GDP dissociation inhibitor alpha from Bos taurus, Homo 
sapiens, Mus musculus , and Rattus norvegicus, respectively; 



M&E BioiecWCdl Screen: DK I TK/IW8-07-20 



52 

Rab GDP dissociation inhibitor beta from Homo sapiens , Mus 
musculus , and Rattus norvegicus , respectively; 

Rho GDP-dissociat ion inhibitor 1 from Bos taurus, Homo sapiens, 
Caenorhabditis elegans and Cavia porcellus , respectively; 
5 Rho GDP-dissociat ion inhibitor from Saccharomyces cerevisiae ; 
Rho GDP-dissociat ion inhibitor 2 from Homo sapiens and Mus 
musculus , respectively; and 

Rho GDP-dissociat ion inhibitor from Mus musculus , 
as well as inhibitors homologous therewith isolated from other 
10 sources than those explicitly mentioned. 



ATPase Inhibitors 



Mitochondrial ATPase inhibitors from Bos taurus, Caenorhabditis 
elegans, Pichia jadinii, Sus scrofa, Rattus norvegicus , and 
Saccharomyces cerevisiae, respectively, and 
15 Sodium/potassium ATPase inhibitor from Sus scrofa f 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 



Phospholipase A2 inhibitory proteins 



Annexin I from Bos taurus, Homo sapiens , Mus musculus , Sus 
20 scrofa, Rattus norvegicus , Oryctolagus cuniculus , Cavia cutleri, 
Gallus gallus, and Columba livia, respectively; 
Annexin III from Homo sapiens and Rattus norvegicus , respec- 
t i ve 1 y ; 

Annexin V from Homo sapiens , Mus musculus , Rattus norvegicus , 
25 and Gallus gallus, respectively; 

Uteroglobulin from Oryctolagus cuniculus and Lepus capensis , 
respectively; and 

Phospholipase A2 inhibitor from Trimeresurus flavoviridis , 
as well as inhibitors homologous therewith isolated from other 
30 sources than those explicitly mentioned. 
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Barstar from Bacillus amyloliquefaciens; 

Ribonuclease inhibitor from Homo sapiens, Sus scrofa, and Rattus 
norvegicus , respectively, as well as inhibitors homologous 
therewith isolated from other sources than those explicitly 
mentioned . 

RNA p&lyi&&2.ci&& lulillsl tors 

Bacterial RNA polymerase inhibitors 
T7 as well as inhibitors homologous 
other sources. 

DNA- entry nuclease Inhibitors 

Competence protein J from Bacillus subtilis as well as inhibi- 
tors homologous therewith isolated from other sources. 

Beta- lactamase Inhibitors 

Beta-lactamase inhibitor from Streptomyces clavuligerus as well 
as inhibitors homologous therewith isolated from other sources. 

To this list of enzyme inhibitors can be added functionally 
related inhibitors such as the inhibitor of calcium transport 
seminalplasmin from Bos taurus and Mus musculus , respectively, 
as well as inhibitors homologous therewith isolated from other 
sources . 

To the above list of suitable scaffold molecules to be used in 
the invention should be added other agents having an effect on 
the activity of enzymes. One interesting candidate is thioredox- 
in . 



from Bacteriophages T3 and 
therewith isolated from 
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All of the above-listed scaffold molecules can be substituted 
with an effective part of the complete molecule. 



Properties of the scaffold molecule 



Since the peptide library presumably can reach every compartment 
5 of a cell, it is beneficial if the scaffold enzyme inhibitor 
protein is not too large, and that it is stable towards e.g. 
proteolytic attack and insensitive to the reducing environment 
inside eukaryotic cells. Hence, its function should preferably 
not be dependent on the formation of disulphide bridges, since 

10 these are not formed in the cytosol or nucleus of such cells. In 
addition the scaffold protein should contain one or more exposed 
loops in which peptides can be inserted without markedly chan- 
ging the structure or stability of the protein. On this basis 
the chymot rypsin inhibitor 2A (cf. the discussion below) is a 

15 suitable scaffold but other scaffolds such as the examples 
mentioned below could also be used. 



The members of the peptide library introduced into target cells 
can effect change of the target cells' phenotype either by 
direct or indirect ( t ransdominant ) interaction. However, the 
20 method of the invention renders possible the identification of 
not only the peptide or ribonucleic acid sequence which is 
responsible for the direct or indirect effect on phenotype, but 
also allows for the purification, isolation and identification 
of the target molecule with which the peptide interacts. 



25 In the present specification the barley-derived chymotrypsin 
inhibitor 2A has been used by way of example, mainly because 
this protein is very well -character i zed and extremely stable. It 
will be understood, however, that the use of this enzyme inhibi- 
tor in the Examples in no way should limit the scope of the 

30 present invention. 
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Barley chymotrypsin inhibitor 2A belongs to a large family of 
homologous protease inhibitors mainly found in plants. This 
family includes barley chymotrypsin inhibitors 1A, IB, 2A and 
2B, potato inhibitor I , wound-induced tomato inhibitor I, 
5 ethylene- responsive tomato inhibitor, wild tomato fruit inhibi- 
tor I, a subtilisin inhibitor from broad bean, adzuki bean 
subtilisin inhibitor, pumpkin trypsin/Hageman factor inhibitor, 
bitter gourd inhibitor, protoplast-specific trypsin inhibitor 
from Nicotiana sylvesfris. fnhacco subtilisin inhibitor,- ama- 
10 ranth trypsin/subt ilisin inhibitor, and beach canavalia subtili- 
sin inhibitor. The only member of this inhibitor family that is 
of non-plant origin is the leech elas t ase/ca thepsin G inhibitor 
eglin C. 

CI-2A was originally purified from the endosperm of the high-ly- 

15 sine barley Hiproly and shown to be a tight binding inhibitor of 
the microbial subtilisins Carlsberg and Novo as well as of 
chymotrypsin. The N-terminal amino acid residue in CI-2A has a 
blocked amino group as direct amino acid sequencing was unsuc- 
cessful. However, most of the amino acid sequence of CI-2A has 

20 been determined at the protein level. In addition, it was shown 
that CI-2A purified from barley is N-terminally processed either 
during synthesis and storage in the endosperm or in the process 
of purification. The absence of the 17 N-terminal amino acid 
residues does not influence the complex formation of CI-2A with 

25 subtilisins. Combining the results of amino acid sequencing and 
cDNA sequencing it has been deduced that CI-2A consists of 83 
amino acid residues in a single polypeptide chain containing no 
disulfide bonds. The blocked N-terminal amino acid residue is 
serine. The reactive site in CI-2A has been determined to be the 

30 Met59-Glu60 peptide bond and the residues in the region Ile56- 
Arg62 have been demonstrated to be involved in the intermolecu- 
lar contacts between inhibitor and protease . 
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The three-dimensional structures of uncomplexed CI-2A as well as 
of CI-2A complexed with subtilisin Novo are known from X-ray 
crystallography. The three-dimensional structure of CI-2A has 
also been determined using NMR spectroscopy, revealing that the 
5 reactive loop of CI-2A is dynamic. CI-2A consists of a single a- 
helix docking against four p-strands. The surface loop stretches 
across the free side of the sheet and is composed of eight 
residues: Gly54-Tyr61. In contrast to most enzyme inhibitors, 
CI-2A lacks disulfide bonds as well as glycosylat ion sites. In 

10 the structures determined, only the 64 C-terminal amino acid 
residues are defined (L20-G83); this truncated version retains 
the functionality of the native protein. Comparing the complexed 
form with the two uncomplexed forms of CT-2A reveals few diffe- 
rences. The most notable difference is that the reactive site 

15 loop seems to have a less ordered structure in the uncomplexed 
forms than in the complexed form. The three-dimensional struc- 
ture of CI-2A in complex with subtilisin Novo has also been 
compared to the three-dimensional structure of eglin C in com- 
plex with subtilisin Carlsberg. The two homologous inhibitors 

20 have highly similar secondary and tertiary structures. 

Recombinant variants of N-terminally truncated CI-2A 
(CI-2A (L20-G83) ) and CI-2A with an N-terminal Asp-Pro extension 
have been widely used to study the folding and stability of 
CI-2A. The structure of CI-2A in complex with subtilisin Novo 
25 has revealed that the number of intermolecular contacts between 
inhibitor and protease in the P4-P1 region ( I Ie56-Met 59 ) of the 
inhibitor are much larger than in the Pl f -P3 f region 
(Glu60-Arg62) . 

Further aspects of the invention 

30 Having identified a modulator according to the above-detailed 
methods of the invention it is normally of interest to provide 
large quantities thereof for the purposes of further research 
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and development, including possible identification of the target 
molecule with which the modulator physically interacts. 

Therefore, the invention also pertains to a method for the 
preparation of a replicable expression vector, the method com- 
prising the steps of identifying a modulator by the methods of 
the invention, and subsequently 

- isolating or synthesizing a nucleic acid sequence which 
encodes the modulator and 

- engineering a replicable expression vector comprising an 
operon which comprises, in the 5'-3' direction and in 
operable linkage, 1) a promoter for driving expression of 
the nucleic acid sequence, 2) optionally a nucleotide 
sequence encoding a leader peptide, 3) the nucleic acid 
sequence, and 4) a termination signal. 

Such methods are widely known in the field of genetic engineer- 
ing and molecular biology. The skilled person will find suitable 
guidance in e.g. Sambrook J, Fritsch EF, Maniatis T. 1989. 
"Molecular cloning: A laboratory manual", 2nd ed . Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y. 

In this part of the invention it is preferred that the promoter 
is inducible; constitutive promoters are not excluded though. 
Depending on the choice of host cell to carry and express the 
vector, the promoter is selected from the group consisting of a 
bacterial promoter, a fungal promoter such as a yeast promoter, 
and a mammalian promoter. The vector can be in the form of a 
plasmid, a phage, a cosmid, a minichromosome , or a virus, again 
depending on choice of host cell and other considerations apply- 
ing for the specific case. However, it is most preferred that 
the expression vector is capable of being integrated into the 
genome of a suitable host cell, since the expression therein 
will then be more stable over time than is the case with non- 
chromosomal transformation of the host cell. 
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Well-known vector systems are based on bacterial plasmid pBR322, 
A-phage, and yeast plasmid YRp7, but other suitable and feasible 
choices are known to the skilled person. 

After having provided a suitable expression vector as outlined 
5 above, it is preferred that the modulator is produced by trans- 
forming a suitable host cell with an expression vector prepared 
as described above. Such a host cell can be bacterial (e.g. E. 
coll, Bacillus subtilis, or any other suitable bacterial host 
cell), a fungus (e.g. a yeast cell such as Saccharomyces 
10 cerevisiae or Piccia pastoris) or a plant, insect or mammalian 
cell (which can be any of the above-discussed cells or cell- 
types suitable for use as the "substantially identical cells" in 
the method of the invention) . 

After having provided a producer cell line as described above it 
15 is now possible to produce the modulator of the invention by 
growing the transformed cell prepared as described above in a 
culture medium under conditions which facilitates expression by 
the cell of the randomly modified nucleotide sequence, and 
subsequently harvesting the expression product from the cell 
20 and/or the culture medium. Alternatively, the modulator can be 
produced by synthesizing the modulator by means of chemical 
synthesis on the basis of the sequence determined in step (e) . 
In the case that the modulator is a peptide, the well-known 
techniques of solid- or liquid-phase peptide synthesis can be 
25 employed and also if the modulator is a ribonucleic acid, me- 
thods for synthetic production thereof are readily available. 

An important part of the invention pertains to the isola- 
tion/identification of the target biomolecules which are enga- 
ging the modulator, the identification, isolation and production 
30 of which is described above. Hence an important part of the 

invention pertains to a method for isolating a target biomole- 
cule, the method comprising providing a modulator according to 
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the methods described herein and subsequently using the modula- 
tor as an affinity ligand in an affinity purification step so as 
to isolate the target biomolecule from a suitable sample. The 
affinity purification step can e.g. be an affinity chromato- 
5 graphic step, an affinity mass spectrometry step, or a co- 
immunoprecipitat ion step. However, any suitable method for 
affinity-based isolation/purification can be employed. 

Alternatively, the modulator ran he used as a probe against a 
cDNA library derived from the substantially identical cells or 
10 as bait in a two- or three-hybrid system (bacterial, fungal or 
mammalian) . 

It is preferred that the target biomolecule is a peptide or a 
nucleic acid, since this also allows of sequence determination 
thereof . 

15 The potential use of such a target biomolecule is to employ it 
in a drug-development program. For this purpose it is often 
useful to resolve the 3-dimensional structure of the target 
biomolecule (by means of methods available, e.g. X-ray diffrac- 
tion studies, NMR analysis, etc) . 

20 Having isolated a target biomolecule as described above, the 
invention allows for the rational selection of a chemical com- 
pound to be used as a lead compound in drug development, the 
method comprising the steps of 

- assaying a library of chemical compounds for interaction 
25 with a target biomolecule which has been de novo isolated 

according to the methods of the, and 

- selecting compounds which interact significantly with the 
target biomolecule . 
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This approach can be refined by initially identifying the mem- 
bers of the library by methods of structure-based or non-struc- 
ture based computer drug-modelling. Suitable non-structure based 
methods are disclosed in US 5,307,287 and US 5,025,388; a method 
5 known as CoMFA) . An alternative is HASL {Hypothetical Active 

Site Lattice; Hypothesis Software) . Both these methods are based 
on 3D-QSAR. A feasible structure-based approach is e.g. dis- 
closed in WO 95/06293. 



Finally, a very important part of the present invention pertains 
10 to a method for the preparation of a medicinal product, the 
method comprising the steps of 

a) selecting a lead compound by the methods of the invention 
described above, 

b) performing pre-clinical tests with the lead compound in 
15 order to assess the suitability thereof as a medicinal 

product, 

c) entering, if the lead compound is deemed suitable in step 
(b) , clinical trials using the lead compound in order to 
obtain market authorization for a medicinal product in- 

20 eluding the lead compound as a pharmaceutically active 

substance, and 

d) upon grant of a market authorization, admixing the lead 
compound with a pharmaceutically acceptable carrier, 
excipient or diluent and marketing the thus obtained 

25 medicinal product. 



The above-outlined method should of course take into conside- 
ration all necessary requirements in order to meet GCP and GMP 
standards. At any rate, this method is completely depending on 
the initial provision of the modulator identified according to 
30 the invention. 
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LEGEND TO THE FIGURE 

Fig. 1: Functional expression of CI-2A in NIH3T3 cells. 
Subtilisin was incubated with increasing amounts of the indi- 
cated cell extracts and subsequently assayed for proteolytic 
5 activity. Each reaction with a given extract concentration was 
determined in triplicate and the shown velocities are based on 
the rue a n vaiuco - 

PREAMBLE TO EXAMPLES 

In the following, the present invention is illustrated by way of 
10 example wherein CI-2A is used as starting point for the Cell- 
Screen technique adapted according to the invention so as to 
allow intracellular expression of a scaffold protein inhibitor 
which is randomly modified in the active site. This example is 
non-limiting, in the sense that other suitable protein inhibi- 
15 tors of enzymes could be used instead of CI-2A. The skilled 
person can readily perform the necessary substitutions and 
modifications necessary in order to apply the principles exem- 
plified below using other protein inhibitors of enzymes as 
starting point. 

2 0 EXAMPLE 1 

Expression of CI-2A truncate in NIH-3T3 cells 
Construction of CMVbipep/CI-2A 

The CMVbipep/CI-2A construct was created by insertion of a CI-2A 
derived cDNA fragment that encodes amino acids 21-84 from the 
25 wild type CI-2A sequence (cf. the numbering in SEQ ID NO: 1) and 
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an N-terminally located methionine residue into the BamHI and 
Xhol sites within the CMVbipep retroviral vector (a standard 
retroviral vector described in WO 96/38553, cf. Fig. 1 therein). 
The 5117 base pair BamHI/XhoI fragment of CMVbipep was gel 
5 purified and extracted using a QIAquick gel extraction kit 
(Qiagen) according to the manufacturer's instructions. The 
fragment of the CI-2A gene was amplified from a plasmid contain- 
ing the entire CI-2A encoding DNA sequence by use of the primers 



in a standard polymerase chain reaction (PCR) . The amplified DNA 
fragment was purified using the QUAquick PCR purification kit 
(Qiagen) according to the manufacturer's protocol and subse- 
quently digested with BamHI and Xhol and purified as described 
15 above. An approximately 10-fold molar excess of the CI-2A frag- 
ment was incubated together with the CMVbipep derived fragment 
under standard ligation conditions (New England biolabs) . Posi- 
tive clones were identified and confirmed by DNA sequencing. 

Transduction of NIH-3T3 cells 

20 CMVbipep and CMVbipepCI-2A were transfected into the BOSC pack- 
aging cell line using a CaP0 4 co-precipitation method. BOSC 
packaging cells (also known as BOSC23 cells, cf. WO 94/19478) 
were diluted to 5 x 10 5 cells/cm 2 and incubated at standard cell 
culturing conditions in Dulbecco's modified eagle media { DMEM, 

25 GibcoBRL) including supplements (10% fetal calf serum, L-gluta- 
mine, 50 mg/ml penicillin/streptomycin (complete DMEM, Gibco- 
BRL) . To prepare the BOSC cells for transfection they were 
washed once in complete DMEM 2 hours before the transfection. 
The CaP0 4 co-precipitated mixtures were prepared by diluting 10 

30 mg CMVbipep/CI-2A or CMVbipep with 10 mg salmon sperm DNA in 450 
ml ddH 2 0 and adding 50 ml 2 . 5 M CaCl 2 . These solutions were 



5'- 



cgggatccatgaaGAcagagtggccagaG and 
CGCTCGAGTCAGCCGACCCTGGGGACCT 



10 



5' - 
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slowly added to 500 ml 2 * HEPES -buffered saline pH 7.05 (280 mM 
NaCl, 1.5 mM Na 2 HPO,, 50 mM HEPES /NaOH pH 7.05) while gently 
shaking followed by two to five minutes of incubation at 25°C 
before adding the precipitate to the prepared BOSC cells. After 
5 24 hours of incubation, the cells were washed twice in PBS (137 
mM NaCl, 2.7 mM KC1, 8.3 mM Na 2 HP0 4 1.4 mM KH 2 POJ and further 
cultivated in 10 ml DMEM including supplements for another 24 
hours. The media from the transfected BOSC cells were collected 
and diluted from 10 to 10 b fold in complete DMRM including £ 

10 mg/ml Polybene. N1H3T3 cells that had been diluted into 10 4 

cells/cm 2 and incubated at standard cell culture conditions for 
24 hours in complete DMEM were incubated with the virus-contain- 
ing media for 24 hours at standard conditions. After this incu- 
bation period the transduced cell were washed twice in PBS and 

15 incubated in DMEM including supplements containing 0.6 mg/ml 
Geniticin (GibcoBRL) . 



Prepara tion of cell extracts 



The CMVbipep/CI-2A or CMVbipep transduced NIH3T3 cell were each 
harvested from two confluent T175 cell culturing bottles by 

20 incubation with 5 ml 0.5 * Tryps in-EDTA solution 

(GibcoBRL) /plate for 10 min at 37°C. The recovered cells were 
diluted 1:1 in complete DMEM and subsequently collected by 
centrifugation, washed twice in 10ml DMEM and twice in 1 ml PBS. 
Finally, the cells were suspended 100 ml PBS. The cells were 

25 rendered permeable by three cycles of freezing in liquid nitro- 
gen and thawing by incubation at 37°c and subsequently centri- 
fuged at 20000 * g for 15 minutes cell to remove cell debris. To 
inactivate endogenous protease activity the extracts were incu- 
bated at 65°C for 15 minutes and re-cent ri fuged . 

30 Subt ilisin assay 
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The indicated cell extracts were diluted to the indicated 
amounts in 10 ml PBS and added to 25 ml 0.1 M Tris/HCl pH 8.6 
containing 5 * 10" 9 M Subtilisin Carlsberg (Sigma). After incuba- 
tion at 25°C for 30 minutes, 25 ml 0.1 M Tris/HCl pH 8 . 6 con- 
5 taining 5 mM N-succinyl-Ala-Ala-Pro-Phe-p-Ni t roanilide (Sigma) 
was added and the substrate conversion was followed by measuring 
the absorbance at 405 nm every two minutes until the reaction 
had reached exhaustion. 

Results/discussion 

10 A potential scaffold for intracellular presentation of a peptide 
library expressed by virtue of a retroviral vector should not 
have any significant effects on neither the retroviral replica- 
tion cycle nor the viability of mammalian cells. Furthermore, it 
must retain a stable structure. In order to test whether CI-2A 

15 fulfill these requirements CMVbipep/CI-2A was constructed con- 
sisting of the cDNA that encodes the CI-2A amino acids 21-84 
inserted into the CMVbipep retroviral vector. This CI-2A expres- 
sion construct was in parallel with CMVbipep transiently 
transfected into BOSC packaging cells and the produced viruses 

20 were transduced into NIH-3T3 cells. 

The titer of both CMVbipep and CMVbipep/CI-2A, determined as 
geneticin resistant colonies, were in both cases found to be 10 s 
infectious virions/ml media. This demonstrated that the presence 
of the CI-2A gene within the viral genome did not affect the 
25 viral replication cycle to any detectable extent. 

To detect whether any expression of CI-2A occurs in the trans- 
duced cells the potential of cell extracts obtained from the 
NIH3T3 cells transduced with either CMVbipep or CMVbipep/CI-2A 
to inhibit the proteolytic activity of subtilisin was deter- 
30 mined. This was done by incubation of various amounts of the 
cell extracts with subtilisin and subsequently detecting the 
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residual protease activity using a chromogenic substrate (cf. 
Fig. 1). A complete block of subtilisin activity was observed 
when the reaction was performed in the presence of between 10 
and 0.2 ml of the extract derived from the CMVbipep/C I -2A trans- 
5 duced NIH3T3 cells whereas approximately 50% of the subtilisin 
activity was maintained when 0.1 ml extract was present. If the 
amount of CMVbipep/CI -2A extract was further decreased, the 
subtilisin activity was not detectably reduced. The specificity 
of the CI-2A mediated inhibitory effect was demonstrated by the 
10 observation that even as much as 10 ml of extract from CMVbipep 
transduced control NIH3T3 cells had no effect on the subtilisin 
act i vi ty . 

The data presented demonstrates that CI-2A does not affect the 
viral packaging and integration processes and that it can be 

15 expressed in detectable amounts from the CMVbipep vector. The 

ability of CI-2A to inhibit the proteolytic activity of subtili- 
sin furthermore suggests that CI-2A adopts a functional struc- 
ture within mammalian cells. This is of great importance for the 
use of CI-2A as a scaffold since it then becomes possible to 

20 utilise the impressive amount of structural information avail- 
able on CI-2A to ensure that a peptide library is properly 
presented. The possibility of expanding the CMVbipep transduced 
NIH3T3 cell indicates that the presence of CI-2A does not affect 
to cell viability, further supporting that CI-2A is a suitable 
25 scaffold for intracellular presentation of peptides. 



EXAMPLE 2 

Chymotrypsin inhibitor 2A (CI-2A) as scaffold in the CellScreen 
technology 

As demonstrated in Example 1, CI-2A can be expressed in a func- 
tional form in mammalian cells. The provision of a functional 
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system for displaying randomized peptide sequence using CI-2A as 
a scaffold is thus relatively uncomplicated to envisage. 

It has been shown that insertions of 7, 9, 11 and 13 residues 
between the Met40 and Glu41 (corresponding to the Met59 and 
5 Glu60 in the native molecule) in CI-2A have a minimal effect on 
the stability and folding rates of the protein. The loop there- 
fore seems to be suitable for insertions of residues for the 
purposes of the present invention. Moreover, the truncated form 
of CI-2A has been found to fold by interactions of key residues 
10 in the - and C-terminal domains of the protein irrespective of 
whether the loop is cleaved. 

CI-2A is thus an extremely stable protein that has the advantage 
of being small, having no disulfide bridges, no glycosylat ion 
sites and best of all - a loop of eight amino acids that can be 
15 manipulated without any major stability loss. To this should be 
added that CI-2A is of plant origin, and therefore its use in 
prokaryotic and mammalian systems will not be obscured by the 
presence of highly homologous proteins. 

Especially the lack of disulphide bridges qualifies CI-2A as a 
20 scaffold for displaying diversity in the nucleus and/or cytosol- 
ic compartment of the cell. As a scaffold, the 64 residue ver- 
sion of CI-2A could be used, since the N-terminal 19 amino acid 
residues do not have any known function. Peptide libraries could 
then be inserted into the active-site loop or several amino acid 
25 residues in the loop could be substituted with a peptide li- 
brary. A peptide library inserted in the active-site loop of a 
protease inhibitor would very likely contain protease inhibitors 
of several new specificities but, as detailed above, also se- 
quences reactive with other biomolecules in vivo. 

30 By fusing the C-terminal of the truncated CI-2A to the surface 
protein of the bacteriophage pill, it is contemplated to use 
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CI-2A scaffolds for selection by phage display, which would make 
it possible to enrich the library for binders to intracellular 
components before expressing the library intracellularly in 
mammalian cells. In order to direct the CI-2A scaffold to dif- 
5 ferent compartments of the cell retroviral vectors holding 
different leader sequences will be constructed, cf. the above 
discussion . 

To be abie to isolate the Ci-2A scaffold - and thereby Llie 
target molecule to which it binds - from selected cells, a 

10 peptide tag will be engineered to the N-terminus of the trun- 
cated CI-2A. We are currently considering the following tags: 
His-tag, Strep-tag or FLAG-tag. However, for future use we are 
planning to use antibodies with specificity to CI-2A or to a 
peptide sequence of CI-2A. Peptide libraries of different sizes 

15 (six- to nine-mers) would either be inserted between Met40 and 
Glu41 or amino acid residues in the loop would be substituted 
with a peptide library. 



NMLE Bioitch. , CellScr«n2 DK l/PK'l99S-07-20 



68 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: M&E Biotech A/S 

(B) STREET: Kogle Alle 6 

(C) CITY; Horsholm 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2970 

(G) TELEPHONE: +45 45 16 25 25 

(H) TELEFAX: +45 45 16 25 00 

(ii) TITLE OF INVENTION: Novel method for the identification of ligand 
and target biomolecules 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS- DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hordeum vulgare 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 85 . . 336 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION : 88 . . 336 

(ix) FEATURE: 

(A) NAME/KEY; Duplication 

(B) LOCATION: 24 9 

(D) OTHER INFORMATION: G is G or T 

(ix) FEATURE : 

(A) NAME/KEY: Duplication 

(B) LOCATION: 252 
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{ D) OTHER INFORMATION: A is A or C 

(ix) FEATURE: 

(A) NAME/ KEY : Duplication 

(B) LOCATION: 279 

(D) OTHER INFORMATION: C is C or T 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CATTAAACTG ATGACATGAC AGTTCAAGAT CTCACAGTCA CATCGGCGAT CTAATCAGTC 60 

TCACAGGAAG CGAGCGTAAC AAGG ATG AGT TCA GTG GAG AAG AAG CCG GAG 111 

Met Ser Ser Val Glu Lys Lys Pro Glu 
-11 5 

GGA GTG AAC ACC GGT GCT GGT GAC CGT CAC AAC CTG AAG ACA GAG TGG 159 
Gly Val Asn Thr Gly Ala Gly Asp Arg His Asn Leu Lys Thr Glu Trp 
10 15 20 

CCA GAG TTG GTG GGG AAA TCG GTG GAG GAG GCC AAG AAG GTG ATT CTG 207 
Pro Glu Leu Val Gly Lys Ser Val Glu Glu Ala Lys Lys Val lie Leu 
25 30 35 40 

CAG GAC AAG CCA GAG GCG CAA ATC ATA GTT CTG CCG GTG GGG ACA ATT 2 55 

Gin Asp Lys Pro Glu Ala Gin lie He Val Leu Pro Val Gly Thr He 
45 50 55 

GTG ACC ATG GAA TAT CGG ATC GAC CGC GTC CGC CTC TTT GTC GAT AAA 303 

Val Thr Met Glu Tyr Arg He Asp Arg Val Arg Leu Phe Val Asp Lys 

60 65 70 

CTC GAC AAC ATT GCC CAG GTC CCC AGG GTC GGC TAGCAAGCTT GAGAGCTAGC 356 

Leu Asp Asn He Ala Gin Val Pro Arg Val Gly 
75 80 

CTGCTGCTGG CGTGTATGTA TTGCAGCTTC ACCATCTCTT CTTGGCTATA GCAAGATTGA 416 

GATTTATAAA TCATATACAA TAAGAGTTGC TGCGG 4 51 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Ser Val Glu Lys Lys Pro Glu Gly Val Asn Thr Gly Ala Gly 
-11 5 10 15 

Asp Arg His Asn Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser 
20 25 30 
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Val Glu Glu Ala Lys Lys Val lie 
35 

lie lie Val Leu Pro Val Gly Thr 

50 55 

Asp Arg Val Arg Leu Phe Val Asp 
65 70 

Pro Arg Val Gly 
80 
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Leu Gin Asp Lys Pro Glu Ala Gin 
40 45 

He Val Thr Met Glu Tyr Arg He 
60 

Lys Leu Asp Asn He Ala Gin Val 
75 
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CLAIMS 

1. A method for identifying an in vivo active modulator of 
activity of a target enzyme, the method comprising the steps of 

(a) preparing a pool of expression vectors, each vector of 
5 said pool containing at least one member from a library of 

randomly modified nucleotide sequences derived from a 

r~. * - f~ i ~ +- \ r* j= cr-.^'T^rp r* r\ c c\ H i n a 3 r> 3 r e n t 12* e t> t i d e or 
parent ribonucleic acid which modulates the target enzyme 
activity, 

10 (b) transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 
cells, said substantially identical cells being ones which 
harbour the target enzyme , 

(c) culturing said transformed cells under conditions 

15 facilitating expression of said randomly modified nucleo- 

tide sequences, 

(d) examining said transformed cells and isolating transformed 
cell (s) wherein the activity of the target enzyme is 
modulated, and 

20 (e) determining said randomly modified nucleotide sequence of 
said vector present in cell(s) isolated in step (d) and/or 
determining the amino acid sequence or the ribonucleic 
acid sequence of the expression product encoded by said 
randomly modified nucleotide sequence. 

25 2. The method according to claim 1, wherein the randomly modi- 
fied nucleotide sequences consist of 1) an invariable part of 
the parent nucleotide sequence, and 2) random nucleotides. 
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3. The method according to claim 2, wherein the invariable part 
of the parent nucleotide sequence encodes a scaffold portion of 
the parent peptide or of the parent ribonucleic acid which 
serves to stabilize said polypeptide fragment or ribonucleic 
5 acid fragment. 



4. A method for identifying a modulator in the form of a biolo- 
gically active polypeptide fragment or ribonucleic acid fragment 
which is capable of detectably modulating, in vivo, a phenotypic 
trait in a cell, the method comprising the steps of 



10 (a) preparing a pool of expression vectors, each vector of 

said pool containing at least one member from a library of 
randomly modified nucleotide sequences derived from a 
parent nucleotide sequence encoding a parent peptide or 
parent ribonucleic acid which in vivo modulates activity 

15 of a known enzyme, wherein the randomly modified nucleo- 

tide sequences comprise 

- an invariable part encoding a scaffold portion of 
the parent peptide or of the parent ribonucleic 
acid, said scaffold portion serving to stabilize 

20 said polypeptide fragment or ribonucleic acid frag- 

ment, and 

— random nucleotides, 



(b) transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 
25 cells, 



(c) culturing said transformed cells under conditions 

facilitating expression of said randomly modified nucleo- 
tide sequences , 



(d) examining said transformed cells and isolating transformed 
30 cell(s) wherein the preselected phenotypic trait is modu- 
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lated by the presence of the expressed randomly modified 
nucleotide sequence, and 

(e) determining said randomly modified nucleotide sequence of 
said vector present in cell (s) isolated in step (d) and/or 
5 determining the amino acid sequence or the ribonucleic 

acid sequence of the expression product encoded by said 
randomly modified nucleotide sequence. 

5. The method according to any of the preceding claims wherein 
the substantially identical cells are prokaryotic cells. 

10 6. The method according to claim 5, wherein the prokaryotic 

cells are bacteria selected from the group consisting of Bacil- 
lus spp., Clostridium spp., Corynebact er ium spp., Staphylococcus 
spp., Streptococcus spp., Escherichia coli, Serratia marcescens, 
Klebsiella spp., Proteus spp., Citrobacter spp., Salmonella 

15 spp., Shigella spp., Pseudomonas spp., Acine tobacter spp., 
Aeromonas spp., Plesiomonas spp., Yersinia spp., Francisella 
tularensis, Vibrio spp., Campylobacter spp., Helicobacter pylo- 
ri, Haemophilus spp., Bordetella spp., Brucella spp., Neisseria 
spp., Treponema pallidum, Leptospira interrogans; Borrelia spp., 

20 Legionella pneumophila, Listeria monocytogenes, Mycobacterium 
spp., Treponema pallidum, Chlamydia trachomatis, Actinomyces 
spp., Rickettsia spp., Mycoplasma spp., Lactococcus spp., and 
Lactobacil lus spp . 

7. The method according to claim 6, wherein 
25 the Clostridium spp. is selected from the group consisting of C. 
botulinum, C. difficile, C. perfringens, and C. tetani 
the Corynebacterium spp. is selected from the group consisting 
of C. diphtheriae, and C. pyogenes; 

the Bacillus spp. is selected from the group consisting of B. 
30 anthracis, B. subtilis and B. cereus; 
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the Staphylococcus spp. is selected from the group consisting of 
S. aureus and S. albicans; 

the Streptococcus spp. is selected from the group consisting of 
S. pneumoniae, S. pyogenes, and S. agalactiae; 
5 the Klebsiella spp. is K. pneumoniae; 
the Proteus spp. is P. mirabilis; 

the Citrobacter spp. is Citrobacter freundii;the Salmonella spp. 
is selected from the group consisting of S. typhi, S. typhimuri- 
um f S. shottmulleri and S. paratyphi; 
10 the Shigella spp. is selected from the group consisting of S. 
dystenteriae, S. flexneri, S. boydii, and S. sonnei; 
the Pseudomonas spp. is selected from the group consisting of P. 
aeruginosa, P. pseudomallei , and P. mallei; 

the Yersinia spp. is selected from the group consisting of Y. 
15 pestis, Y. enterocolitica, and Y. pseudotuberculosis; 

the Vibrio spp. is selected from the group consisting of V. 
cholerae and V. parahaemolyt icus ; 

the Campylobacter spp. is selected from the group consisting of 
C. jejuni and C. coli; 
20 the Haemophilus spp. is selected from the group consisting of H. 
influenzea, H. pa rainf luenzae , and H. aegyptius; 

the Bordetella spp. is selected from the group consisting of B. 
pertussis, B. parapertussis, and B. bronchisept ica ; 
the Neisseria spp. is selected from the group consisting of N. 
25 gonnorhoeae and N. meningitidis; 

the Borrelia spp. is selected from the group consisting of B. 
burgdorferi sensu stricto, B. garinii, B. afzelii, and B. 
recurrent is; 

the Mycobacterium spp. is selected from the group consisting of 
30 M. tuberculosis, M. bovis, M. africanum, M. kansasii, and M. 
leprae; and 

the Mycoplasma spp. is M. pneumoniae. 

8. The method according to any one of claims 1-4, wherein the 
35 substantially identical cells are eukaryotic cells. 
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9. The method according to claim 8, wherein the eukaryotic cells 
are selected from the group consisting of fungal cells, proto- 
zoan cells, animal cells, and plant cells. 

10. The method according to claim 9, wherein the fungal cells 
5 are selected from the group consisting of Epidermophyton spp., 

Trichophyton spp., Microsporum spp., Candida albicans, Philo- 
phora spp., Coccidioides immitis, Histoplasma capsulatum, 

Crypt ococcus neoformans , Aspergi 1 lus spp . , Saccharomyces cerevi- 
10 siae, Klyveromyces lactis, and Piccia pastoris. 

11. The method according to claim 9, wherein the protozoan cells 
are selected from the group consisting of Giardia lamblia, 
Trichomonas vaginalis, Dientamoeba fragilis, Trypanosoma spp., 
Leishmania spp. , Entamoeba histolytica, Naegleria fowleri, 

1 5 Acanthamoeba castellani, Harmanella spp . , Isospora bel 1 i , 
Cryptosporidium spp . , Sarcocyst is spp . , Toxoplasma gondi i , 
Plasmodium spp. , Babesia spp. , and Balantidium coli . 

12 . The method according to claim 9 , wherein the plant cells are 
derived f rom a plant selected from the group consisting of 

20 Nicotiana tabacum (tobacco plant), Arabidopsis thaliana, Bras- 
si ca napus , Brassica j uncea, Musa sp . (banana plant ) , rice, and 
corn . 

13. The method according to claim 9, wherein the animal cells 
are selected from the group consisting of mammalian cells, 

25 arthropod cells such as insect cells, avian cells, and piscine 
cells . 

14. A method according to claim 13, wherein the mammalian cells 
are selected from the group consisting of Chinese hamster ovary 
(CHO) cells, VERO cells, HeLa cells, W138 cells, BHK cells, 

30 COS-7 293 cells, and MDCK cells. 
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15. The method according to any of the preceding claims, wherein 
the transformed cells examined in step (d) predominantly carries 
one single copy of the vector. 

16. The method according to claim 15, wherein transformation 
5 step (b) is performed under such conditions that the cells 

transformed are predominantly or at most transformed with one 
single vector from said pool, or wherein, prior to carrying out 
step (d) , cells being transformed with more than one vector from 
said pool are substantially excluded from the further steps. 

10 17. The method according to any of the preceding claims, wherein 
the modulator is a peptide. 

18. The method according to any of claims 1-16, wherein the 
modulator is a nucleic acid fragment such as an RNA fragment. 

19. The method according to any of the preceding claims, wherein 
15 the modulator is stable towards proteolytic attack and/or is 

insensitive to a reducing environment. 

20. The method according to any one of claims 2-19, wherein the 
random nucleotides are introduced in part(s) of the parent 
nucleotide sequence which encode (s) the active site(s) of the 

20 parent peptide or parent ribonucleic acid, or the part(s) which 
encode (s) structure (s) interfering with the active site(s). 

21. The method according to any one of claims 2-20, wherein the 
invariable part of the nucleotide sequence encodes truncated 
parts of the parent peptide or parent ribonucleic acid suffi- 

25 cient to maintain stability. 

22. The method according to any of claims 2-21, wherein the 
invariable part of the parent nucleotide sequence encodes a 
peptide which is free from disulphide bridges. 
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23. The method according to any one of claims 2-21, wherein the 
invariable part of the parent nucleotide sequence encodes a 
peptide having disulphide bridges. 

24. The method according to any one of claims 2-23, wherein the 

5 random nucleotides are introduced in the form of an insertion or 
a substitution into the parent nucleotide sequence, optionally 
in combination with deletion (s) in the parent nucleotide se- 
quence . 

25. The method according to claim 24, wherein the number of 

10 random nucleotides which are introduced is in the range from 3 
to about 100. 

26. The method according to any one of claims 2-25, wherein the 
random nucleotides are nucleotide sequences and/or are single 
random nucleotides introduced at specific sites in the parent 

15 nucleotide sequence. 

27. The method according to any one of claims 2-26, wherein the 
random nucleotides are selected from the group consisting of 

synthetic, completely random deoxyribonucleot ides ; 

synthetic random DNA sequences, wherein limitation on 
20 randomization of some nucleotides is introduced so as to 

limit the number of available sequences and/or to avoid 
undesired stop codons and/or to facilitate introduction of 
post-translational modifications of expressed peptide (s); 

synthetic random DNA sequences as in (1) or (2) coupled to 
25 a sequence encoding a purification tag; and 

CDR encoding nucleotide sequences isolated from a library 
of immune-competent cells raised against an antigen. 
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28. The method according to claim 27, wherein the CDR encoding 
nucleotide sequences encode CDR-3 peptide sequences. 

29. The method according to any one of claims 26-28, wherein the 
random nucleotides are prepared by codon split synthesis where 

5 defined DNA codons are synthesized in a random order. 

30. The method according to claim 29, wherein the relative 
amount of synthesized codons ensure that all encoded amino acids 
will be present with substantially the same frequency in encoded 
polypeptide fragments . 

10 31. The method according to any one of claims 2-30, wherein the 
random nucleotides are introduced into the expression vector by 
the principle of site directed PCR-mediated mutagenesis. 

32. The method according to any one of the preceding claims, 
wherein the modulator in vivo reduces KM of the target enzyme 

15 for at least one substrate. 

33. The method according to any one of claims 1-31, wherein the 
modulator in vivo increases KM of the target enzyme for at least 
one substrate. 

34. The method according to any of the preceding claims, wherein 
20 the modulator in vivo increases Vmax of the target enzyme for at 

least one substrate. 

35. The method according to any of claims 1-33, wherein the 
modulator in vivo reduces Vmax of the target enzyme for at least 
one substrate. 

25 36. The method according to any one of the preceding claims, 
wherein the parent peptide or parent ribonucleic acid is an 
inhibitor of activity of the target enzyme. 
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37. A method according to claim 36, wherein the inhibitor is 
selected from the group consist ing of 

a BPTI/Kunitz family protease inhibitor, a serpin family prote- 
ase inhibitor, a Kazal family protease inhibitor, a soybean 
5 trypsin inhibitor (Kunitz) family protease inhibitor, a potato 
inhibitor I family member, a Bowman-Birk family protease inhibi- 
tor, a squash inhibitor family member, a wap-type ' Four-disul- 
fide Core 1 proteinase inhibitor, a hirudin family protease 

10 family member, a cystatin family protease inhibitor, a calpain 
family cysteine protease inhibitor, a tissue inhibitor of 
metalloproteinases family member, a carboxypept idase A inhibi- 
tor, a metal locarboxypept idase inhibitor, an angio tens in-con- 
vert ing enzyme inhibitor, a cereal alpha- amylase/ trypsin inhibi- 

15 tor family member, an alpha-amylase/t rypsin inhibitor homologous 
to thaumatin, an alpha-amylase/subtilisin inhibitor family 
member, an inhibitors of insect alpha-amylases, an inhibitor of 
mammalian alpha-amylases derived from Streptomyces species, a 
trehalase inhibitor, a polygalacturonase inhibitor, a 

20 fucosyl transferase inhibitor, a protein kinase C inhibitor, 

an cAMP-dependent protein kinase inhibitor, a cyclic nucleotide 
phosphodiesterase inhibitor, a protein phosphatase inhibitor, a 
TCD/MRS6 family GDP dissociation inhibitor, an ATPase inhibitor, 
a phospholipase A2 inhibitory protein, a ribonuclease inhibitor, 

25 an RNA polymerase inhibitor, a DNA-entry nuclease inhibitor, and 
a beta- lactamase inhibitor . 

38. The method according to any of the preceding claims wherein 
the substantially identical cells are mammalian cells and the 
vector is selected from the group consisting of a retroviral 

30 vector, a vaccinia virus vector, an adenoviral vector, an adeno 
associated virus (AAV) vector, a herpes simplex virus (HSV) 
vector, an alpha virus vector, and a semliki forest virus vec- 
tor. 
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39. The method according to claim 38, wherein the vector is 
retroviral . 

40. The method according to claim 39, wherein the retroviral 
vector is derived from retrovirus selected from the group con- 

5 sisting of Avian Leukosis-Sarcoma Virus (ASLV) , Mammalian type 
C, Mammalian type B, and Lentivirus, and optionally modified 
with heterologous cis-acting elements . 

41. The method according to claim 39 or 40, wherein the retrovi- 
ral vector has non-identical ends. 

10 42. The method according to claim 41, wherein the non-identical 
ends contain non-identical promoters. 

43. The method according to any one of claims 39-42, wherein the 
retroviral vector contains heterologous promoter replacing the 
viral promoter in the 5 1 -LTR , such as a CMV promoter, an RSV 

15 promoter, an SV-40 promoter, a TK promoter, an MT promoter, or 
an inducible system such as Tet or Ecdysone. 

44. The method according to any one of claims 39-43, wherein 
step (a) is carried out by 

1) transfecting suitable packaging cells with vectors com- 
20 prising the randomly modified nucleotide sequences and 

which are integratable in virions produced by said pack- 
aging cells, 

2) culturing said transfected packaging cells in a culture 
medium under conditions which facilitates production by 

25 the packaging cells of virions containing the randomly 

modified nucleotide sequences , 



3) 



recovering and optionally concentrating said virions, and 
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4) 



transducing said substantially identical cells with the 



vx r ions . 



45. The method according to claim 44, wherein the packaging 
cells are selected from the group consisting of PE501, Bosc23, 

5 Psi2, GP+E86, PhoenixEco, PA317, GP+AM12, DA(ampho), Bing, 
FLYA13, ProPak, CRIP, PsiAM, Phoenix-Ampho, PG13, and H9 
(293GPG) . 

46. The method according to any of the preceding claims wherein 
the randomly modified nucleotide sequences are coupled to a 

10 nucleotide sequence encoding at least one fusion partner. 

47. The method according to claim 46, wherein the fusion partner 
serves to facilitate expression and/or purification/isolation 
and/or further stabilization of the expression product. 

48. The method according to claim 47, wherein the fusion partner 
15 includes a purification tag such as His6 tag, myc tag, BSP 

biot inylation target sequence, of BirA, flu tag, lacZ, and GST. 

49. The method according to claim 46 or 47, wherein the fusion 
partner is a sorting signal or a targeting sequence. 

50. The method according to claim 45, wherein the sorting signal 
20 is a signal patch or a signal peptide. 

51. The method according to claim 49 or 50, wherein the sorting 
signal effects export of the expressed peptide out of the cell 
or into the cell membrane, or, when the substantially identical 
cells are eukaryotic, into endoplasmic reticulum, into Golgi 

25 apparatus, into lysosomes, into secretory vesicles, into mito- 
chondria, into peroxisomes, or into the nucleus. 



M&E Biotec^CeflScrecfi: DK KTK/1W8-07O0 



82 



52. The method any one of claims 49-51, wherein the sorting 
signal or targeting sequence is selected from the group consis- 

ting of 

a nuclear localization signal (NLS) such as Pro-Lys-Lys-Lys-Arg- 

Lys-Val (SV40 large T antigen NLS), Ala-Arg-Arg-Arg-Arg-Pro 
(human retinoic acid receptor-p NLS), Glu-Glu-Val-Gln-Arg-Lys- 

Arg-Gln-Lys-Leu (NFkB P 50), Glu-Glu-Lys-Arg-Lys-Arg-Thr-Tyr-Glu 
(NFkB P 65), and Ala-Val-Lys-Arg-Pro-Ala-Ala-Thr-Lys-Lys-Ala-Gly- 

Gln-Ala-Lys-Lys-Lys-Lys-Leu-Asp (Xenopus nucleoplasmin NLS) ; 
a membrane anchoring sequence such as those derived from CDS, 
ICAM-2, IL-8, CD4, and LFA-1 , and a lipidation sequence such as 
a myristylation or a palmitoylation sequence; 
a lysosomal sorting signal such as a lysosomal degradation 
sequence, and a lysosomal membrane sequencea mitochondrial 
localization sequence such as a mitochondrial matrix sequence, a 
mitochondrial inner membrane sequence, a mitochondrial intermem- 
brane space sequence, and a mitochondrial outer membrane se- 
quence; an endoplasmic reticulum localization sequence such as 
the sequence from calreticulin (KDEL) and the sequence from 
adenovirus E3/19K protein (LULSRRSFIDEKKMP) ; 

a peroxisome sequence such as the peroxisome matrix sequence 
from Luc i f erase ; 

a farnesylation sequence such as LNPPDESGPGCMSCKCVLS ; 
a geranylgeranylation sequence such as LTEPTQPTRNQCCSN ; 
a destruction sequence such as RTALGDIGN; and 

a secretory signal sequence such as the secretory signals from 
IL-2, growth hormone, preproinsulin, and influenza HA protein. 

53. The method according to any one of the preceding claims, 
which further comprises the step of resolving the 3-dimensional 
structure of the identified modulator. 

54. A method for the preparation of a replicable expression 
vector, the method comprising the steps of identifying a modula- 
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tor by the method according to any one of claims 1-53, and sub- 

sequent 1 y 

i) isolating or synthesizing a nucleic acid sequence which 

encodes the modulator, and 
5 ii) engineering a replicable expression vector comprising an 
operon which comprises, in the 5'-3' direction and in 
operable linkage, 1) a promoter for driving expression of 
the nucleic acid sequence, 2) optionally a nucleotide 
sequence encoding a leader peptide, 3) the nucleic acid 
1Q sequence, and 4) a termination signal. 

55. The method according to claim 54, wherein the promoter is 

inducible . 

56. The method according to claim 55, wherein the promoter is 
selected from the group consisting of a bacterial promoter, a 

15 fungal promoter such as a yeast promoter, and a mammalian pro- 
moter . 

57. A method according to any of claims 54-56, wherein the 
expression vector is a plasmid, a phage, a cosmid, a minichromo- 
some, or a virus. 

20 58. A method according to any of claims 54-57, wherein the 

expression vector is capable of being integrated into the genome 
of a suitable host cell. 

59. A method according to any of claims 54-58, wherein the 
nucleotide sequence is incorporated into a vector selected from 

25 the group consisting of P BR322, phage-A and YRp7 . 

60. A method for the preparation of a transformed cell carrying 
a nucleic acid sequence encoding a modulator as defined in any 
one of claims 1-53, the method comprising transforming a suit- 
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able host cell with an expression vector prepared according to 
any one of claims 54-59. 

61. A method for providing a modulator as defined in any one of 
claims 1-53, the method comprising 

5 I) growing a transformed cell prepared according to the 

method of claim 60 in a culture medium under conditions 
which facilitates expression by the cell of the randomly 
modified nucleotide sequence, and 
II) subsequently harvesting the expression product from the 
10 cell and/or the culture medium, or 

la) identifying the modulator according to the method of any 

one of claims 1-53, and 
lb) subsequently synthesizing the modulator by means of chemi- 
cal synthesis on the basis of the sequence determined in 
15 step (e) . 

62. A method for isolating a target biomolecule, the method 
comprising providing a modulator according to the method of 
claim 61 and subsequently using the modulator as an affinity 
ligand in an affinity purification step so as to isolate the 

20 target biomolecule from a suitable sample. 

63. The method according to claim 62, wherein the affinity 
purification step is an affinity chromatographic step, an affi- 
nity mass spectrometry step, or a co-immunoprecipitat ion step. 

64. A method for isolating a target biomolecule, the method 

25 comprising providing a peptide modulator according to the method 
of claim 61 and subsequently using the modulator as a probe 
against a cDNA library derived from the substantially identical 
cells or using the modulator as bait in a two- or three-hybrid 
system. 
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65. The method according to any of claims 61-63, wherein the 
target biomolecule is a peptide or a nucleic acid. 

66. The method according to claim 64 or 65 further comprising 
the step of determining the amino acid sequence of the peptide 

5 or determining the nucleotide sequence of the nucleic acid. 

67. The method according to claim 66, further comprising the 
step ot resolving che 3— dimensional structure of the target 
biomolecule . 

68. A method for selecting a chemical compound as a lead com- 
10 pound in drug development, the method comprising the steps of 

- assaying a library of chemical compounds for interaction 

with a target biomolecule which has been de novo isolated 
according to the method of any one of claims 62-61, and 
selecting compounds which interact significantly with the 
15 target biomolecule . 

69. The method according to claim 68, wherein the library of 
chemical compounds has been provided by chemical synthesis upon 
initial identification of the members of the library by 
structure-based or non-structure based computer drug-modelling. 

20 70. A method for the preparation of a medicinal product, the 
method comprising the steps of 

A) selecting a lead compound by the method according to claim 
68 or 69, 

B) performing pre-clinical tests with the lead compound in 
25 order to assess the suitability thereof as a medicinal 

product , 

C) entering, if the lead compound is deemed suitable in step 
(B), clinical trials using the lead compound in order to 
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obtain market authorization for a medicinal product in- 
cluding the lead compound as a pharmaceut ical ly active 
substance, and 

D) upon grant of a market authorization, admixing the lead 

compound with a pharmaceut ical ly acceptable carrier, 
excipient or diluent and marketing the thus obtained 
medicinal product . 
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